Overview
Brought to you by YData
Dataset statistics
| Number of variables | 42 |
|---|---|
| Number of observations | 98879 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 32 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 203.0 MiB |
| Average record size in memory | 2.1 KiB |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 33 |
| Text | 1 |
| Dataset has 32 (< 0.1%) duplicate rows | Duplicates |
age is highly overall correlated with detailed_household_and_family_stat and 2 other fields | High correlation |
citizenship is highly overall correlated with country_of_birth_father and 2 other fields | High correlation |
class_of_worker is highly overall correlated with detailed_industry_recode and 4 other fields | High correlation |
country_of_birth_father is highly overall correlated with citizenship and 3 other fields | High correlation |
country_of_birth_mother is highly overall correlated with citizenship and 3 other fields | High correlation |
country_of_birth_self is highly overall correlated with citizenship and 2 other fields | High correlation |
detailed_household_and_family_stat is highly overall correlated with age and 4 other fields | High correlation |
detailed_household_summary_in_household is highly overall correlated with detailed_household_and_family_stat and 3 other fields | High correlation |
detailed_industry_recode is highly overall correlated with class_of_worker and 2 other fields | High correlation |
detailed_occupation_recode is highly overall correlated with class_of_worker and 2 other fields | High correlation |
education is highly overall correlated with tax_filer_stat and 1 other fields | High correlation |
family_members_under_18 is highly overall correlated with detailed_household_and_family_stat and 3 other fields | High correlation |
fill_inc_questionnaire_for_veteran's_admin is highly overall correlated with veterans_benefits | High correlation |
full_or_part_time_employment_stat is highly overall correlated with live_in_this_house_1_year_ago and 2 other fields | High correlation |
hispanic_origin is highly overall correlated with country_of_birth_father and 1 other fields | High correlation |
live_in_this_house_1_year_ago is highly overall correlated with full_or_part_time_employment_stat and 6 other fields | High correlation |
major_industry_code is highly overall correlated with class_of_worker and 3 other fields | High correlation |
major_occupation_code is highly overall correlated with class_of_worker and 3 other fields | High correlation |
marital_stat is highly overall correlated with tax_filer_stat | High correlation |
migration_code_change_in_msa is highly overall correlated with live_in_this_house_1_year_ago and 5 other fields | High correlation |
migration_code_change_in_reg is highly overall correlated with full_or_part_time_employment_stat and 4 other fields | High correlation |
migration_code_move_within_reg is highly overall correlated with live_in_this_house_1_year_ago and 5 other fields | High correlation |
migration_prev_res_in_sunbelt is highly overall correlated with live_in_this_house_1_year_ago and 3 other fields | High correlation |
num_persons_worked_for_employer is highly overall correlated with class_of_worker and 2 other fields | High correlation |
region_of_previous_residence is highly overall correlated with live_in_this_house_1_year_ago and 3 other fields | High correlation |
tax_filer_stat is highly overall correlated with age and 8 other fields | High correlation |
veterans_benefits is highly overall correlated with age and 6 other fields | High correlation |
weeks_worked_in_year is highly overall correlated with num_persons_worked_for_employer and 1 other fields | High correlation |
year is highly overall correlated with full_or_part_time_employment_stat and 4 other fields | High correlation |
enroll_in_edu_inst_last_wk is highly imbalanced (74.3%) | Imbalance |
race is highly imbalanced (61.8%) | Imbalance |
hispanic_origin is highly imbalanced (71.6%) | Imbalance |
member_of_a_labor_union is highly imbalanced (67.5%) | Imbalance |
reason_for_unemployment is highly imbalanced (89.1%) | Imbalance |
region_of_previous_residence is highly imbalanced (78.5%) | Imbalance |
migration_code_move_within_reg is highly imbalanced (54.9%) | Imbalance |
migration_prev_res_in_sunbelt is highly imbalanced (70.5%) | Imbalance |
family_members_under_18 is highly imbalanced (50.3%) | Imbalance |
country_of_birth_father is highly imbalanced (70.6%) | Imbalance |
country_of_birth_mother is highly imbalanced (71.2%) | Imbalance |
country_of_birth_self is highly imbalanced (81.5%) | Imbalance |
citizenship is highly imbalanced (65.2%) | Imbalance |
own_business_or_self_employed is highly imbalanced (67.5%) | Imbalance |
fill_inc_questionnaire_for_veteran's_admin is highly imbalanced (94.3%) | Imbalance |
target is highly imbalanced (66.2%) | Imbalance |
dividends_from_stocks is highly skewed (γ1 = 25.25847346) | Skewed |
age has 1358 (1.4%) zeros | Zeros |
wage_per_hour has 93295 (94.4%) zeros | Zeros |
capital_gains has 95157 (96.2%) zeros | Zeros |
capital_losses has 96971 (98.1%) zeros | Zeros |
dividends_from_stocks has 88351 (89.4%) zeros | Zeros |
num_persons_worked_for_employer has 47009 (47.5%) zeros | Zeros |
weeks_worked_in_year has 47009 (47.5%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-24 04:12:31.910286 |
|---|---|
| Analysis finished | 2025-01-24 04:13:10.753996 |
| Duration | 38.84 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
age
Real number (ℝ)
High correlation  Zeros 
| Distinct | 91 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.868668 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 1358 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 16 |
| median | 33 |
| Q3 | 50 |
| 95-th percentile | 75 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 22.275233 |
|---|---|
| Coefficient of variation (CV) | 0.63883234 |
| Kurtosis | -0.73163274 |
| Mean | 34.868668 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.36351952 |
| Sum | 3447779 |
| Variance | 496.18599 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 1750 | 1.8% |
| 34 | 1720 | 1.7% |
| 35 | 1670 | 1.7% |
| 37 | 1646 | 1.7% |
| 38 | 1638 | 1.7% |
| 32 | 1628 | 1.6% |
| 3 | 1628 | 1.6% |
| 30 | 1614 | 1.6% |
| 4 | 1612 | 1.6% |
| 31 | 1605 | 1.6% |
| Other values (81) | 82368 |
| Value | Count | Frequency (%) |
| 0 | 1358 | |
| 1 | 1448 | |
| 2 | 1494 | |
| 3 | 1628 | |
| 4 | 1612 | |
| 5 | 1576 | |
| 6 | 1462 | |
| 7 | 1543 | |
| 8 | 1514 | |
| 9 | 1475 |
| Value | Count | Frequency (%) |
| 90 | 373 | |
| 89 | 109 | 0.1% |
| 88 | 141 | 0.1% |
| 87 | 155 | |
| 86 | 162 | |
| 85 | 216 | |
| 84 | 267 | |
| 83 | 276 | |
| 82 | 324 | |
| 81 | 336 |
class_of_worker
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.7 MiB |
| Not in universe | |
|---|---|
| Private sector | |
| Government | |
| Self-employed | |
| Not employed | 279 |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 14.132414 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Private sector |
|---|---|
| 2nd row | Self-employed |
| 3rd row | Not in universe |
| 4th row | Private sector |
| 5th row | Private sector |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 49199 | |
| Private sector | 36068 | |
| Government | 7405 | 7.5% |
| Self-employed | 5928 | 6.0% |
| Not employed | 279 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 49478 | |
| in | 49199 | |
| universe | 49199 | |
| private | 36068 | |
| sector | 36068 | |
| government | 7405 | 3.2% |
| self-employed | 5928 | 2.5% |
| employed | 279 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 203686 | |
| 134745 | ||
| i | 134466 | |
| t | 129019 | |
| r | 128740 | |
| n | 113208 | |
| o | 99158 | |
| v | 92672 | |
| s | 85267 | 6.1% |
| N | 49478 | 3.5% |
| Other values (13) | 226960 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1157847 | |
| Space Separator | 134745 | 9.6% |
| Uppercase Letter | 98879 | 7.1% |
| Dash Punctuation | 5928 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 203686 | |
| i | 134466 | |
| t | 129019 | |
| r | 128740 | |
| n | 113208 | |
| o | 99158 | |
| v | 92672 | |
| s | 85267 | |
| u | 49199 | 4.2% |
| c | 36068 | 3.1% |
| Other values (7) | 86364 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 49478 | |
| P | 36068 | |
| G | 7405 | 7.5% |
| S | 5928 | 6.0% |
Space Separator
| Value | Count | Frequency (%) |
| 134745 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5928 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1256726 | |
| Common | 140673 | 10.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 203686 | |
| i | 134466 | |
| t | 129019 | |
| r | 128740 | |
| n | 113208 | |
| o | 99158 | |
| v | 92672 | |
| s | 85267 | |
| N | 49478 | 3.9% |
| u | 49199 | 3.9% |
| Other values (11) | 171833 |
Common
| Value | Count | Frequency (%) |
| 134745 | ||
| - | 5928 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1397399 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 203686 | |
| 134745 | ||
| i | 134466 | |
| t | 129019 | |
| r | 128740 | |
| n | 113208 | |
| o | 99158 | |
| v | 92672 | |
| s | 85267 | 6.1% |
| N | 49478 | 3.5% |
| Other values (13) | 226960 |
detailed_industry_recode
Categorical
High correlation 
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 MiB |
| Not in universe or children | |
|---|---|
| Public administration | |
| Manufacturing | 4710 |
| Business and repair services | 3114 |
| Manufacturing-durable goods | 3066 |
| Other values (37) |
Length
| Max length | 58 |
|---|---|
| Median length | 27 |
| Mean length | 24.870377 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Transportation |
|---|---|
| 2nd row | Wholesale and retail trade |
| 3rd row | Not in universe or children |
| 4th row | Business and repair services |
| 5th row | Manufacturing-durable goods |
Common Values
| Value | Count | Frequency (%) |
| Not in universe or children | 49403 | |
| Public administration | 8933 | 9.0% |
| Manufacturing | 4710 | 4.8% |
| Business and repair services | 3114 | 3.1% |
| Manufacturing-durable goods | 3066 | 3.1% |
| Wholesale and retail trade | 2434 | 2.5% |
| Public administration and armed forces | 2304 | 2.3% |
| Trade | 2204 | 2.2% |
| Professional services | 2142 | 2.2% |
| Professional and related services | 1964 | 2.0% |
| Other values (32) | 18605 | 18.8% |
Length
| Value | Count | Frequency (%) |
| not | 50224 | |
| or | 49403 | |
| children | 49403 | |
| in | 49403 | |
| universe | 49403 | |
| services | 13869 | 3.8% |
| and | 13688 | 3.7% |
| public | 11788 | 3.2% |
| administration | 11237 | 3.0% |
| trade | 5335 | 1.4% |
| Other values (45) | 65783 |
Most occurring characters
| Value | Count | Frequency (%) |
| 270657 | ||
| i | 259576 | |
| n | 242723 | |
| e | 238935 | 9.7% |
| r | 235705 | 9.6% |
| o | 148682 | 6.0% |
| s | 136871 | 5.6% |
| t | 114552 | 4.7% |
| a | 105972 | 4.3% |
| u | 103385 | 4.2% |
| Other values (29) | 602100 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2073565 | |
| Space Separator | 270657 | 11.0% |
| Uppercase Letter | 100308 | 4.1% |
| Other Punctuation | 8274 | 0.3% |
| Dash Punctuation | 6354 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 259576 | |
| n | 242723 | |
| e | 238935 | |
| r | 235705 | |
| o | 148682 | 7.2% |
| s | 136871 | 6.6% |
| t | 114552 | 5.5% |
| a | 105972 | 5.1% |
| u | 103385 | 5.0% |
| c | 99350 | 4.8% |
| Other values (11) | 387814 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 50224 | |
| P | 17692 | 17.6% |
| M | 12908 | 12.9% |
| B | 4570 | 4.6% |
| T | 4219 | 4.2% |
| W | 3393 | 3.4% |
| H | 2081 | 2.1% |
| F | 1081 | 1.1% |
| E | 1037 | 1.0% |
| S | 804 | 0.8% |
| Other values (5) | 2299 | 2.3% |
Space Separator
| Value | Count | Frequency (%) |
| 270657 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8274 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6354 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2173873 | |
| Common | 285285 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 259576 | |
| n | 242723 | |
| e | 238935 | |
| r | 235705 | |
| o | 148682 | 6.8% |
| s | 136871 | 6.3% |
| t | 114552 | 5.3% |
| a | 105972 | 4.9% |
| u | 103385 | 4.8% |
| c | 99350 | 4.6% |
| Other values (26) | 488122 |
Common
| Value | Count | Frequency (%) |
| 270657 | ||
| , | 8274 | 2.9% |
| - | 6354 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2459158 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 270657 | ||
| i | 259576 | |
| n | 242723 | |
| e | 238935 | 9.7% |
| r | 235705 | 9.6% |
| o | 148682 | 6.0% |
| s | 136871 | 5.6% |
| t | 114552 | 4.7% |
| a | 105972 | 4.3% |
| u | 103385 | 4.2% |
| Other values (29) | 602100 |
detailed_occupation_recode
Categorical
High correlation 
| Distinct | 47 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| Not in universe | |
|---|---|
| Other executive, admin and managerial | 4356 |
| Food service occupations | 3814 |
| Computer equipment operators | 2771 |
| Personal service occupations | 2670 |
| Other values (42) |
Length
| Max length | 46 |
|---|---|
| Median length | 43 |
| Mean length | 23.328037 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Construction trades |
|---|---|
| 2nd row | Other professional specialty occupations |
| 3rd row | Not in universe |
| 4th row | Management related occupations |
| 5th row | Automobile mechanics and repairers |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 49403 | |
| Other executive, admin and managerial | 4356 | 4.4% |
| Food service occupations | 3814 | 3.9% |
| Computer equipment operators | 2771 | 2.8% |
| Personal service occupations | 2670 | 2.7% |
| Construction trades | 2093 | 2.1% |
| Automobile mechanics and repairers | 2022 | 2.0% |
| Teachers, except college and university | 1806 | 1.8% |
| Supervisors and proprietors, sales occupations | 1742 | 1.8% |
| Forestry and fishing occupations | 1708 | 1.7% |
| Other values (37) | 26494 |
Length
| Value | Count | Frequency (%) |
| not | 49403 | |
| universe | 49403 | |
| in | 49403 | |
| occupations | 24088 | 7.3% |
| and | 23313 | 7.0% |
| other | 10998 | 3.3% |
| service | 9006 | 2.7% |
| operators | 5122 | 1.5% |
| related | 5025 | 1.5% |
| admin | 4632 | 1.4% |
| Other values (83) | 100365 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 252503 | |
| 231879 | ||
| i | 210919 | 9.1% |
| n | 206377 | 8.9% |
| o | 166383 | 7.2% |
| t | 159647 | 6.9% |
| r | 159308 | 6.9% |
| s | 155618 | 6.7% |
| a | 129532 | 5.6% |
| u | 101707 | 4.4% |
| Other values (30) | 532780 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1965593 | |
| Space Separator | 231879 | 10.1% |
| Uppercase Letter | 98879 | 4.3% |
| Other Punctuation | 10302 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 252503 | |
| i | 210919 | |
| n | 206377 | |
| o | 166383 | |
| t | 159647 | |
| r | 159308 | |
| s | 155618 | |
| a | 129532 | 6.6% |
| u | 101707 | 5.2% |
| c | 97524 | 5.0% |
| Other values (15) | 326075 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 49603 | |
| O | 11274 | 11.4% |
| F | 8529 | 8.6% |
| C | 6430 | 6.5% |
| P | 6343 | 6.4% |
| M | 3725 | 3.8% |
| S | 2676 | 2.7% |
| H | 2422 | 2.4% |
| E | 2192 | 2.2% |
| T | 2177 | 2.2% |
| Other values (3) | 3508 | 3.5% |
Space Separator
| Value | Count | Frequency (%) |
| 231879 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10302 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2064472 | |
| Common | 242181 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 252503 | |
| i | 210919 | |
| n | 206377 | |
| o | 166383 | 8.1% |
| t | 159647 | 7.7% |
| r | 159308 | 7.7% |
| s | 155618 | 7.5% |
| a | 129532 | 6.3% |
| u | 101707 | 4.9% |
| c | 97524 | 4.7% |
| Other values (28) | 424954 |
Common
| Value | Count | Frequency (%) |
| 231879 | ||
| , | 10302 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2306653 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 252503 | |
| 231879 | ||
| i | 210919 | 9.1% |
| n | 206377 | 8.9% |
| o | 166383 | 7.2% |
| t | 159647 | 6.9% |
| r | 159308 | 6.9% |
| s | 155618 | 6.7% |
| a | 129532 | 5.6% |
| u | 101707 | 4.4% |
| Other values (30) | 532780 |
education
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.7 MiB |
| High School Graduate | |
|---|---|
| Children | |
| Below High School | |
| Some College | |
| College Graduate |
Length
| Max length | 20 |
|---|---|
| Median length | 16 |
| Mean length | 14.531731 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Below High School |
|---|---|
| 2nd row | Some College |
| 3rd row | Children |
| 4th row | High School Graduate |
| 5th row | High School Graduate |
Common Values
| Value | Count | Frequency (%) |
| High School Graduate | 24141 | |
| Children | 22600 | |
| Below High School | 18733 | |
| Some College | 18719 | |
| College Graduate | 9884 | |
| Advanced Degree | 4802 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| high | 42874 | |
| school | 42874 | |
| graduate | 34025 | |
| college | 28603 | |
| children | 22600 | |
| below | 18733 | |
| some | 18719 | |
| advanced | 4802 | 2.2% |
| degree | 4802 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 170491 | |
| o | 151803 | 10.6% |
| l | 141413 | 9.8% |
| 119153 | 8.3% | |
| h | 108348 | 7.5% |
| g | 76279 | 5.3% |
| a | 72852 | 5.1% |
| d | 66229 | 4.6% |
| i | 65474 | 4.6% |
| S | 61593 | 4.3% |
| Other values (14) | 403248 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1099698 | |
| Uppercase Letter | 218032 | 15.2% |
| Space Separator | 119153 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 170491 | |
| o | 151803 | |
| l | 141413 | |
| h | 108348 | |
| g | 76279 | |
| a | 72852 | |
| d | 66229 | 6.0% |
| i | 65474 | 6.0% |
| r | 61427 | 5.6% |
| c | 47676 | 4.3% |
| Other values (6) | 137706 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 61593 | |
| C | 51203 | |
| H | 42874 | |
| G | 34025 | |
| B | 18733 | 8.6% |
| A | 4802 | 2.2% |
| D | 4802 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 119153 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1317730 | |
| Common | 119153 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 170491 | |
| o | 151803 | |
| l | 141413 | |
| h | 108348 | 8.2% |
| g | 76279 | 5.8% |
| a | 72852 | 5.5% |
| d | 66229 | 5.0% |
| i | 65474 | 5.0% |
| S | 61593 | 4.7% |
| r | 61427 | 4.7% |
| Other values (13) | 341821 |
Common
| Value | Count | Frequency (%) |
| 119153 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1436883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 170491 | |
| o | 151803 | 10.6% |
| l | 141413 | 9.8% |
| 119153 | 8.3% | |
| h | 108348 | 7.5% |
| g | 76279 | 5.3% |
| a | 72852 | 5.1% |
| d | 66229 | 4.6% |
| i | 65474 | 4.6% |
| S | 61593 | 4.3% |
| Other values (14) | 403248 |
wage_per_hour
Real number (ℝ)
Zeros 
| Distinct | 894 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.947613 |
| Minimum | 0 |
|---|---|
| Maximum | 9900 |
| Zeros | 93295 |
| Zeros (%) | 94.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 500 |
| Maximum | 9900 |
| Range | 9900 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 271.35721 |
|---|---|
| Coefficient of variation (CV) | 4.9384713 |
| Kurtosis | 148.26554 |
| Mean | 54.947613 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.7180739 |
| Sum | 5433165 |
| Variance | 73634.733 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 93295 | |
| 500 | 360 | 0.4% |
| 600 | 285 | 0.3% |
| 700 | 281 | 0.3% |
| 800 | 249 | 0.3% |
| 1000 | 213 | 0.2% |
| 425 | 178 | 0.2% |
| 900 | 154 | 0.2% |
| 550 | 143 | 0.1% |
| 1100 | 125 | 0.1% |
| Other values (884) | 3596 | 3.6% |
| Value | Count | Frequency (%) |
| 0 | 93295 | |
| 100 | 2 | < 0.1% |
| 150 | 3 | < 0.1% |
| 178 | 1 | < 0.1% |
| 200 | 12 | < 0.1% |
| 205 | 1 | < 0.1% |
| 208 | 1 | < 0.1% |
| 209 | 1 | < 0.1% |
| 210 | 3 | < 0.1% |
| 211 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9900 | 2 | |
| 8831 | 1 | |
| 8800 | 1 | |
| 8000 | 2 | |
| 7700 | 1 | |
| 7500 | 1 | |
| 7400 | 1 | |
| 7000 | 2 | |
| 6500 | 2 | |
| 6000 | 1 |
enroll_in_edu_inst_last_wk
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
| Not in universe | |
|---|---|
| High school | 3499 |
| College or university | 2830 |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 16.030178 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 92550 | |
| High school | 3499 | 3.5% |
| College or university | 2830 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 92550 | |
| in | 92550 | |
| universe | 92550 | |
| high | 3499 | 1.2% |
| school | 3499 | 1.2% |
| college | 2830 | 1.0% |
| or | 2830 | 1.0% |
| university | 2830 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 293138 | ||
| i | 194259 | |
| e | 193590 | |
| n | 187930 | |
| o | 105208 | 6.6% |
| s | 98879 | 6.2% |
| r | 98210 | 6.2% |
| v | 95380 | 6.0% |
| u | 95380 | 6.0% |
| t | 95380 | 6.0% |
| Other values (8) | 127694 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1193031 | |
| Space Separator | 293138 | 18.5% |
| Uppercase Letter | 98879 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 194259 | |
| e | 193590 | |
| n | 187930 | |
| o | 105208 | |
| s | 98879 | |
| r | 98210 | |
| v | 95380 | |
| u | 95380 | |
| t | 95380 | |
| l | 9159 | 0.8% |
| Other values (4) | 19656 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 92550 | |
| H | 3499 | 3.5% |
| C | 2830 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 293138 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1291910 | |
| Common | 293138 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 194259 | |
| e | 193590 | |
| n | 187930 | |
| o | 105208 | |
| s | 98879 | |
| r | 98210 | |
| v | 95380 | |
| u | 95380 | |
| t | 95380 | |
| N | 92550 | |
| Other values (7) | 35144 | 2.7% |
Common
| Value | Count | Frequency (%) |
| 293138 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1585048 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 293138 | ||
| i | 194259 | |
| e | 193590 | |
| n | 187930 | |
| o | 105208 | 6.6% |
| s | 98879 | 6.2% |
| r | 98210 | 6.2% |
| v | 95380 | 6.0% |
| u | 95380 | 6.0% |
| t | 95380 | 6.0% |
| Other values (8) | 127694 |
marital_stat
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.3 MiB |
| Married | |
|---|---|
| Never Married | |
| Divorced | |
| Widowed | |
| Separated | 1696 |
Length
| Max length | 21 |
|---|---|
| Median length | 13 |
| Mean length | 9.7658451 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Married |
|---|---|
| 2nd row | Married |
| 3rd row | Never Married |
| 4th row | Divorced |
| 5th row | Divorced |
Common Values
| Value | Count | Frequency (%) |
| Married | 42422 | |
| Never Married | 42272 | |
| Divorced | 6450 | 6.5% |
| Widowed | 5324 | 5.4% |
| Separated | 1696 | 1.7% |
| Married-spouse absent | 715 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married | 84694 | |
| never | 42272 | |
| divorced | 6450 | 4.5% |
| widowed | 5324 | 3.8% |
| separated | 1696 | 1.2% |
| married-spouse | 715 | 0.5% |
| absent | 715 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 221236 | |
| e | 186549 | |
| d | 104203 | |
| i | 97183 | |
| a | 89516 | |
| M | 85409 | 8.8% |
| v | 48722 | 5.0% |
| 42987 | 4.5% | |
| N | 42272 | 4.4% |
| o | 12489 | 1.3% |
| Other values (12) | 35071 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 780784 | |
| Uppercase Letter | 141151 | 14.6% |
| Space Separator | 42987 | 4.5% |
| Dash Punctuation | 715 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 221236 | |
| e | 186549 | |
| d | 104203 | |
| i | 97183 | |
| a | 89516 | |
| v | 48722 | 6.2% |
| o | 12489 | 1.6% |
| c | 6450 | 0.8% |
| w | 5324 | 0.7% |
| p | 2411 | 0.3% |
| Other values (5) | 6701 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 85409 | |
| N | 42272 | |
| D | 6450 | 4.6% |
| W | 5324 | 3.8% |
| S | 1696 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 42987 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 715 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 921935 | |
| Common | 43702 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 221236 | |
| e | 186549 | |
| d | 104203 | |
| i | 97183 | |
| a | 89516 | |
| M | 85409 | 9.3% |
| v | 48722 | 5.3% |
| N | 42272 | 4.6% |
| o | 12489 | 1.4% |
| c | 6450 | 0.7% |
| Other values (10) | 27906 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 42987 | ||
| - | 715 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 965637 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 221236 | |
| e | 186549 | |
| d | 104203 | |
| i | 97183 | |
| a | 89516 | |
| M | 85409 | 8.8% |
| v | 48722 | 5.0% |
| 42987 | 4.5% | |
| N | 42272 | 4.4% |
| o | 12489 | 1.3% |
| Other values (12) | 35071 | 3.6% |
major_industry_code
Categorical
High correlation 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 MiB |
| Not in universe or children | |
|---|---|
| Retail trade | |
| Manufacturing-durable goods | 4445 |
| Education | 4227 |
| Manufacturing-nondurable goods | 3394 |
| Other values (19) |
Length
| Max length | 36 |
|---|---|
| Median length | 28 |
| Mean length | 24.316478 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Manufacturing-durable goods |
|---|---|
| 2nd row | Business and repair services |
| 3rd row | Not in universe or children |
| 4th row | Transportation |
| 5th row | Construction |
Common Values
| Value | Count | Frequency (%) |
| Not in universe or children | 49403 | |
| Retail trade | 8711 | 8.8% |
| Manufacturing-durable goods | 4445 | 4.5% |
| Education | 4227 | 4.3% |
| Manufacturing-nondurable goods | 3394 | 3.4% |
| Construction | 3066 | 3.1% |
| Finance insurance and real estate | 3019 | 3.1% |
| Business and repair services | 2985 | 3.0% |
| Medical except hospital | 2304 | 2.3% |
| Transportation | 2211 | 2.2% |
| Other values (14) | 15114 | 15.3% |
Length
| Value | Count | Frequency (%) |
| not | 49403 | |
| universe | 49403 | |
| or | 49403 | |
| children | 49403 | |
| in | 49403 | |
| services | 10782 | 3.0% |
| trade | 10533 | 2.9% |
| retail | 8711 | 2.4% |
| goods | 7839 | 2.2% |
| and | 6676 | 1.9% |
| Other values (34) | 67347 |
Most occurring characters
| Value | Count | Frequency (%) |
| 358903 | ||
| e | 243705 | |
| i | 224203 | 9.3% |
| n | 220082 | 9.2% |
| r | 219335 | 9.1% |
| o | 150196 | 6.2% |
| t | 120058 | 5.0% |
| s | 115646 | 4.8% |
| a | 95172 | 4.0% |
| c | 92888 | 3.9% |
| Other values (28) | 564201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1935840 | |
| Space Separator | 358903 | 14.9% |
| Uppercase Letter | 101807 | 4.2% |
| Dash Punctuation | 7839 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 243705 | |
| i | 224203 | |
| n | 220082 | |
| r | 219335 | |
| o | 150196 | |
| t | 120058 | 6.2% |
| s | 115646 | 6.0% |
| a | 95172 | 4.9% |
| c | 92888 | 4.8% |
| u | 92400 | 4.8% |
| Other values (11) | 362155 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 49403 | |
| M | 10468 | 10.3% |
| R | 8711 | 8.6% |
| E | 5048 | 5.0% |
| H | 4763 | 4.7% |
| P | 4128 | 4.1% |
| C | 3661 | 3.6% |
| F | 3137 | 3.1% |
| B | 2985 | 2.9% |
| T | 2211 | 2.2% |
| Other values (5) | 7292 | 7.2% |
Space Separator
| Value | Count | Frequency (%) |
| 358903 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7839 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2037647 | |
| Common | 366742 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 243705 | |
| i | 224203 | |
| n | 220082 | |
| r | 219335 | |
| o | 150196 | 7.4% |
| t | 120058 | 5.9% |
| s | 115646 | 5.7% |
| a | 95172 | 4.7% |
| c | 92888 | 4.6% |
| u | 92400 | 4.5% |
| Other values (26) | 463962 |
Common
| Value | Count | Frequency (%) |
| 358903 | ||
| - | 7839 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2404389 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 358903 | ||
| e | 243705 | |
| i | 224203 | 9.3% |
| n | 220082 | 9.2% |
| r | 219335 | 9.1% |
| o | 150196 | 6.2% |
| t | 120058 | 5.0% |
| s | 115646 | 4.8% |
| a | 95172 | 4.0% |
| c | 92888 | 3.9% |
| Other values (28) | 564201 |
major_occupation_code
Categorical
High correlation 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.3 MiB |
| Not in universe | |
|---|---|
| Adm support including clerical | |
| Professional specialty | |
| Executive admin and managerial | |
| Other service | |
| Other values (10) |
Length
| Max length | 38 |
|---|---|
| Median length | 36 |
| Mean length | 20.779073 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Machine operators assmblrs & inspctrs |
|---|---|
| 2nd row | Professional specialty |
| 3rd row | Not in universe |
| 4th row | Executive admin and managerial |
| 5th row | Precision production craft & repair |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 49403 | |
| Adm support including clerical | 7252 | 7.3% |
| Professional specialty | 6869 | 6.9% |
| Executive admin and managerial | 6288 | 6.4% |
| Other service | 6176 | 6.2% |
| Sales | 6021 | 6.1% |
| Precision production craft & repair | 5354 | 5.4% |
| Machine operators assmblrs & inspctrs | 3186 | 3.2% |
| Handlers equip cleaners etc | 2070 | 2.1% |
| Transportation and material moving | 2040 | 2.1% |
| Other values (5) | 4220 | 4.3% |
Length
| Value | Count | Frequency (%) |
| not | 49403 | |
| in | 49403 | |
| universe | 49403 | |
| and | 11320 | 3.7% |
| support | 8724 | 2.8% |
| 8540 | 2.8% | |
| clerical | 7252 | 2.4% |
| adm | 7252 | 2.4% |
| including | 7252 | 2.4% |
| professional | 6869 | 2.2% |
| Other values (33) | 103051 |
Most occurring characters
| Value | Count | Frequency (%) |
| 310539 | ||
| i | 205246 | |
| e | 203495 | |
| n | 177399 | 8.6% |
| r | 149150 | 7.3% |
| s | 128958 | 6.3% |
| t | 107721 | 5.2% |
| o | 103592 | 5.0% |
| a | 100938 | 4.9% |
| u | 79516 | 3.9% |
| Other values (24) | 488060 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1636640 | |
| Space Separator | 310539 | 15.1% |
| Uppercase Letter | 98895 | 4.8% |
| Other Punctuation | 8540 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 205246 | |
| e | 203495 | |
| n | 177399 | |
| r | 149150 | |
| s | 128958 | |
| t | 107721 | 6.6% |
| o | 103592 | 6.3% |
| a | 100938 | 6.2% |
| u | 79516 | 4.9% |
| c | 72622 | 4.4% |
| Other values (12) | 308003 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 49403 | |
| P | 13435 | 13.6% |
| A | 7268 | 7.3% |
| E | 6288 | 6.4% |
| O | 6176 | 6.2% |
| S | 6021 | 6.1% |
| T | 3512 | 3.6% |
| M | 3186 | 3.2% |
| H | 2070 | 2.1% |
| F | 1536 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 310539 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 8540 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1735535 | |
| Common | 319079 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 205246 | |
| e | 203495 | |
| n | 177399 | |
| r | 149150 | 8.6% |
| s | 128958 | 7.4% |
| t | 107721 | 6.2% |
| o | 103592 | 6.0% |
| a | 100938 | 5.8% |
| u | 79516 | 4.6% |
| c | 72622 | 4.2% |
| Other values (22) | 406898 |
Common
| Value | Count | Frequency (%) |
| 310539 | ||
| & | 8540 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2054614 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 310539 | ||
| i | 205246 | |
| e | 203495 | |
| n | 177399 | 8.6% |
| r | 149150 | 7.3% |
| s | 128958 | 6.3% |
| t | 107721 | 5.2% |
| o | 103592 | 5.0% |
| a | 100938 | 4.9% |
| u | 79516 | 3.9% |
| Other values (24) | 488060 |
race
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| White | |
|---|---|
| Black | |
| Asian or Pacific Islander | 2909 |
| Other | 1899 |
| Amer Indian Aleut or Eskimo | 1205 |
Length
| Max length | 28 |
|---|---|
| Median length | 6 |
| Mean length | 6.8565014 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | White |
| 3rd row | White |
| 4th row | White |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| White | 82805 | |
| Black | 10061 | 10.2% |
| Asian or Pacific Islander | 2909 | 2.9% |
| Other | 1899 | 1.9% |
| Amer Indian Aleut or Eskimo | 1205 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 82805 | |
| black | 10061 | 8.9% |
| or | 4114 | 3.7% |
| asian | 2909 | 2.6% |
| pacific | 2909 | 2.6% |
| islander | 2909 | 2.6% |
| other | 1899 | 1.7% |
| amer | 1205 | 1.1% |
| indian | 1205 | 1.1% |
| aleut | 1205 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 112426 | ||
| i | 93942 | |
| e | 90023 | |
| t | 85909 | |
| h | 84704 | |
| W | 82805 | |
| a | 19993 | 2.9% |
| c | 15879 | 2.3% |
| l | 14175 | 2.1% |
| k | 11266 | 1.7% |
| Other values (14) | 66842 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 457226 | |
| Space Separator | 112426 | 16.6% |
| Uppercase Letter | 108312 | 16.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 93942 | |
| e | 90023 | |
| t | 85909 | |
| h | 84704 | |
| a | 19993 | 4.4% |
| c | 15879 | 3.5% |
| l | 14175 | 3.1% |
| k | 11266 | 2.5% |
| r | 10127 | 2.2% |
| n | 8228 | 1.8% |
| Other values (6) | 22980 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 82805 | |
| B | 10061 | 9.3% |
| A | 5319 | 4.9% |
| I | 4114 | 3.8% |
| P | 2909 | 2.7% |
| O | 1899 | 1.8% |
| E | 1205 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 112426 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 565538 | |
| Common | 112426 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 93942 | |
| e | 90023 | |
| t | 85909 | |
| h | 84704 | |
| W | 82805 | |
| a | 19993 | 3.5% |
| c | 15879 | 2.8% |
| l | 14175 | 2.5% |
| k | 11266 | 2.0% |
| r | 10127 | 1.8% |
| Other values (13) | 56715 |
Common
| Value | Count | Frequency (%) |
| 112426 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 677964 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 112426 | ||
| i | 93942 | |
| e | 90023 | |
| t | 85909 | |
| h | 84704 | |
| W | 82805 | |
| a | 19993 | 2.9% |
| c | 15879 | 2.3% |
| l | 14175 | 2.1% |
| k | 11266 | 1.7% |
| Other values (14) | 66842 |
hispanic_origin
Categorical
High correlation  Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| All other | |
|---|---|
| Mexican-American | 3981 |
| Mexican (Mexicano) | 3686 |
| Central or South American | 1985 |
| Puerto Rican | 1578 |
| Other values (5) | 2565 |
Length
| Max length | 26 |
|---|---|
| Median length | 10 |
| Mean length | 10.982989 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mexican (Mexicano) |
|---|---|
| 2nd row | All other |
| 3rd row | Mexican-American |
| 4th row | All other |
| 5th row | All other |
Common Values
| Value | Count | Frequency (%) |
| All other | 85084 | |
| Mexican-American | 3981 | 4.0% |
| Mexican (Mexicano) | 3686 | 3.7% |
| Central or South American | 1985 | 2.0% |
| Puerto Rican | 1578 | 1.6% |
| Other Spanish | 1243 | 1.3% |
| Cuban | 613 | 0.6% |
| NA | 400 | 0.4% |
| Chicano | 169 | 0.2% |
| Do not know | 140 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| other | 86327 | |
| all | 85084 | |
| mexican-american | 3981 | 2.0% |
| mexican | 3686 | 1.9% |
| mexicano | 3686 | 1.9% |
| central | 1985 | 1.0% |
| or | 1985 | 1.0% |
| south | 1985 | 1.0% |
| american | 1985 | 1.0% |
| rican | 1578 | 0.8% |
| Other values (8) | 4423 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 196705 | ||
| l | 172153 | |
| e | 107209 | |
| r | 97841 | |
| o | 94907 | |
| t | 92015 | |
| A | 91450 | |
| h | 89724 | |
| n | 23187 | 2.1% |
| a | 22907 | 2.1% |
| Other values (21) | 97889 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 764192 | |
| Space Separator | 196705 | 18.1% |
| Uppercase Letter | 113737 | 10.5% |
| Dash Punctuation | 3981 | 0.4% |
| Open Punctuation | 3686 | 0.3% |
| Close Punctuation | 3686 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 172153 | |
| e | 107209 | |
| r | 97841 | |
| o | 94907 | |
| t | 92015 | |
| h | 89724 | |
| n | 23187 | 3.0% |
| a | 22907 | 3.0% |
| i | 20309 | 2.7% |
| c | 19066 | 2.5% |
| Other values (8) | 24874 | 3.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 91450 | |
| M | 11353 | 10.0% |
| S | 3228 | 2.8% |
| C | 2767 | 2.4% |
| P | 1578 | 1.4% |
| R | 1578 | 1.4% |
| O | 1243 | 1.1% |
| N | 400 | 0.4% |
| D | 140 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 196705 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3981 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3686 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3686 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 877929 | |
| Common | 208058 | 19.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 172153 | |
| e | 107209 | |
| r | 97841 | |
| o | 94907 | |
| t | 92015 | |
| A | 91450 | |
| h | 89724 | |
| n | 23187 | 2.6% |
| a | 22907 | 2.6% |
| i | 20309 | 2.3% |
| Other values (17) | 66227 | 7.5% |
Common
| Value | Count | Frequency (%) |
| 196705 | ||
| - | 3981 | 1.9% |
| ( | 3686 | 1.8% |
| ) | 3686 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1085987 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 196705 | ||
| l | 172153 | |
| e | 107209 | |
| r | 97841 | |
| o | 94907 | |
| t | 92015 | |
| A | 91450 | |
| h | 89724 | |
| n | 23187 | 2.1% |
| a | 22907 | 2.1% |
| Other values (21) | 97889 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.0388657 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Male |
| 4th row | Female |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Female | 51361 | |
| Male | 47518 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 51361 | |
| male | 47518 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 150240 | |
| 98879 | ||
| a | 98879 | |
| l | 98879 | |
| F | 51361 | 8.6% |
| m | 51361 | 8.6% |
| M | 47518 | 8.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 399359 | |
| Space Separator | 98879 | 16.6% |
| Uppercase Letter | 98879 | 16.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 150240 | |
| a | 98879 | |
| l | 98879 | |
| m | 51361 | 12.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 51361 | |
| M | 47518 |
Space Separator
| Value | Count | Frequency (%) |
| 98879 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 498238 | |
| Common | 98879 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 150240 | |
| a | 98879 | |
| l | 98879 | |
| F | 51361 | 10.3% |
| m | 51361 | 10.3% |
| M | 47518 | 9.5% |
Common
| Value | Count | Frequency (%) |
| 98879 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 597117 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 150240 | |
| 98879 | ||
| a | 98879 | |
| l | 98879 | |
| F | 51361 | 8.6% |
| m | 51361 | 8.6% |
| M | 47518 | 8.0% |
member_of_a_labor_union
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 MiB |
| Not in universe | |
|---|---|
| No | 8034 |
| Yes | 1445 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 14.768373 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 89400 | |
| No | 8034 | 8.1% |
| Yes | 1445 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 89400 | |
| in | 89400 | |
| universe | 89400 | |
| no | 8034 | 2.9% |
| yes | 1445 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 277679 | ||
| e | 180245 | |
| i | 178800 | |
| n | 178800 | |
| N | 97434 | 6.7% |
| o | 97434 | 6.7% |
| s | 90845 | 6.2% |
| t | 89400 | 6.1% |
| u | 89400 | 6.1% |
| v | 89400 | 6.1% |
| Other values (2) | 90845 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1083724 | |
| Space Separator | 277679 | 19.0% |
| Uppercase Letter | 98879 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 180245 | |
| i | 178800 | |
| n | 178800 | |
| o | 97434 | |
| s | 90845 | |
| t | 89400 | |
| u | 89400 | |
| v | 89400 | |
| r | 89400 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97434 | |
| Y | 1445 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 277679 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1182603 | |
| Common | 277679 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 180245 | |
| i | 178800 | |
| n | 178800 | |
| N | 97434 | |
| o | 97434 | |
| s | 90845 | |
| t | 89400 | |
| u | 89400 | |
| v | 89400 | |
| r | 89400 |
Common
| Value | Count | Frequency (%) |
| 277679 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1460282 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 277679 | ||
| e | 180245 | |
| i | 178800 | |
| n | 178800 | |
| N | 97434 | 6.7% |
| o | 97434 | 6.7% |
| s | 90845 | 6.2% |
| t | 89400 | 6.1% |
| u | 89400 | 6.1% |
| v | 89400 | 6.1% |
| Other values (2) | 90845 | 6.2% |
reason_for_unemployment
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 MiB |
| Not in universe | |
|---|---|
| Job loser | 1616 |
| Re-entrant | 1024 |
| Job leaver | 286 |
| New entrant | 204 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.827446 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 95749 | |
| Job loser | 1616 | 1.6% |
| Re-entrant | 1024 | 1.0% |
| Job leaver | 286 | 0.3% |
| New entrant | 204 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 95749 | |
| in | 95749 | |
| universe | 95749 | |
| job | 1902 | 0.7% |
| loser | 1616 | 0.6% |
| re-entrant | 1024 | 0.4% |
| leaver | 286 | 0.1% |
| new | 204 | 0.1% |
| entrant | 204 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 196142 | |
| n | 193954 | |
| 193604 | ||
| i | 191498 | |
| o | 99267 | |
| r | 98879 | |
| t | 98205 | |
| s | 97365 | |
| v | 96035 | |
| N | 95953 | |
| Other values (8) | 105221 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1172616 | |
| Space Separator | 193604 | 13.2% |
| Uppercase Letter | 98879 | 6.7% |
| Dash Punctuation | 1024 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 196142 | |
| n | 193954 | |
| i | 191498 | |
| o | 99267 | |
| r | 98879 | |
| t | 98205 | |
| s | 97365 | |
| v | 96035 | |
| u | 95749 | |
| b | 1902 | 0.2% |
| Other values (3) | 3620 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 95953 | |
| J | 1902 | 1.9% |
| R | 1024 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 193604 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1024 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1271495 | |
| Common | 194628 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 196142 | |
| n | 193954 | |
| i | 191498 | |
| o | 99267 | |
| r | 98879 | |
| t | 98205 | |
| s | 97365 | |
| v | 96035 | |
| N | 95953 | |
| u | 95749 | |
| Other values (6) | 8448 | 0.7% |
Common
| Value | Count | Frequency (%) |
| 193604 | ||
| - | 1024 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1466123 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 196142 | |
| n | 193954 | |
| 193604 | ||
| i | 191498 | |
| o | 99267 | |
| r | 98879 | |
| t | 98205 | |
| s | 97365 | |
| v | 96035 | |
| N | 95953 | |
| Other values (8) | 105221 |
full_or_part_time_employment_stat
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.0 MiB |
| Children or Armed Forces | |
|---|---|
| FTE | |
| Not Employed | |
| PTE | 2987 |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 17.13831 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FTE |
|---|---|
| 2nd row | PTE |
| 3rd row | Children or Armed Forces |
| 4th row | Children or Armed Forces |
| 5th row | FTE |
Common Values
| Value | Count | Frequency (%) |
| Children or Armed Forces | 60832 | |
| FTE | 21670 | 21.9% |
| Not Employed | 13390 | 13.5% |
| PTE | 2987 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| children | 60832 | |
| or | 60832 | |
| armed | 60832 | |
| forces | 60832 | |
| fte | 21670 | 7.4% |
| not | 13390 | 4.5% |
| employed | 13390 | 4.5% |
| pte | 2987 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 243328 | |
| e | 195886 | |
| 195886 | ||
| o | 148444 | 8.8% |
| d | 135054 | 8.0% |
| F | 82502 | 4.9% |
| m | 74222 | 4.4% |
| l | 74222 | 4.4% |
| h | 60832 | 3.6% |
| s | 60832 | 3.6% |
| Other values (12) | 423411 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1215486 | |
| Uppercase Letter | 283247 | 16.7% |
| Space Separator | 195886 | 11.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 243328 | |
| e | 195886 | |
| o | 148444 | |
| d | 135054 | |
| m | 74222 | 6.1% |
| l | 74222 | 6.1% |
| h | 60832 | 5.0% |
| s | 60832 | 5.0% |
| c | 60832 | 5.0% |
| n | 60832 | 5.0% |
| Other values (4) | 101002 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 82502 | |
| C | 60832 | |
| A | 60832 | |
| E | 38047 | |
| T | 24657 | 8.7% |
| N | 13390 | 4.7% |
| P | 2987 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 195886 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1498733 | |
| Common | 195886 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 243328 | |
| e | 195886 | |
| o | 148444 | |
| d | 135054 | 9.0% |
| F | 82502 | 5.5% |
| m | 74222 | 5.0% |
| l | 74222 | 5.0% |
| h | 60832 | 4.1% |
| s | 60832 | 4.1% |
| c | 60832 | 4.1% |
| Other values (11) | 362579 |
Common
| Value | Count | Frequency (%) |
| 195886 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1694619 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 243328 | |
| e | 195886 | |
| 195886 | ||
| o | 148444 | 8.8% |
| d | 135054 | 8.0% |
| F | 82502 | 4.9% |
| m | 74222 | 4.4% |
| l | 74222 | 4.4% |
| h | 60832 | 3.6% |
| s | 60832 | 3.6% |
| Other values (12) | 423411 |
capital_gains
Real number (ℝ)
Zeros 
| Distinct | 123 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 429.59091 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 95157 |
| Zeros (%) | 96.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4637.1881 |
|---|---|
| Coefficient of variation (CV) | 10.794428 |
| Kurtosis | 402.82376 |
| Mean | 429.59091 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.209214 |
| Sum | 42477520 |
| Variance | 21503513 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 95157 | |
| 15024 | 380 | 0.4% |
| 7298 | 289 | 0.3% |
| 7688 | 285 | 0.3% |
| 99999 | 188 | 0.2% |
| 5178 | 106 | 0.1% |
| 4386 | 103 | 0.1% |
| 3103 | 101 | 0.1% |
| 5013 | 84 | 0.1% |
| 10520 | 71 | 0.1% |
| Other values (113) | 2115 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 95157 | |
| 114 | 9 | < 0.1% |
| 401 | 22 | < 0.1% |
| 594 | 45 | < 0.1% |
| 914 | 7 | < 0.1% |
| 991 | 27 | < 0.1% |
| 1055 | 43 | < 0.1% |
| 1086 | 61 | 0.1% |
| 1111 | 8 | < 0.1% |
| 1140 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 188 | |
| 41310 | 3 | < 0.1% |
| 34095 | 2 | < 0.1% |
| 27828 | 44 | < 0.1% |
| 25236 | 9 | < 0.1% |
| 25124 | 13 | < 0.1% |
| 20051 | 36 | < 0.1% |
| 18481 | 8 | < 0.1% |
| 15831 | 8 | < 0.1% |
| 15024 | 380 |
capital_losses
Real number (ℝ)
Zeros 
| Distinct | 111 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.240223 |
| Minimum | 0 |
|---|---|
| Maximum | 4608 |
| Zeros | 96971 |
| Zeros (%) | 98.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4608 |
| Range | 4608 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 266.68642 |
|---|---|
| Coefficient of variation (CV) | 7.3588515 |
| Kurtosis | 64.450436 |
| Mean | 36.240223 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.7596025 |
| Sum | 3583397 |
| Variance | 71121.646 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 96971 | |
| 1902 | 213 | 0.2% |
| 1977 | 195 | 0.2% |
| 1887 | 168 | 0.2% |
| 1602 | 106 | 0.1% |
| 1485 | 59 | 0.1% |
| 1848 | 54 | 0.1% |
| 1740 | 52 | 0.1% |
| 2415 | 45 | < 0.1% |
| 1590 | 40 | < 0.1% |
| Other values (101) | 976 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 96971 | |
| 155 | 2 | < 0.1% |
| 213 | 6 | < 0.1% |
| 323 | 2 | < 0.1% |
| 419 | 18 | < 0.1% |
| 625 | 11 | < 0.1% |
| 653 | 5 | < 0.1% |
| 772 | 3 | < 0.1% |
| 810 | 5 | < 0.1% |
| 880 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 4608 | 4 | < 0.1% |
| 4356 | 15 | |
| 3900 | 1 | < 0.1% |
| 3770 | 3 | < 0.1% |
| 3683 | 2 | < 0.1% |
| 3500 | 1 | < 0.1% |
| 3175 | 2 | < 0.1% |
| 3004 | 6 | < 0.1% |
| 2824 | 15 | |
| 2788 | 4 | < 0.1% |
dividends_from_stocks
Real number (ℝ)
Skewed  Zeros 
| Distinct | 1140 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 194.21373 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 88351 |
| Zeros (%) | 89.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 400 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1849.8435 |
|---|---|
| Coefficient of variation (CV) | 9.5247824 |
| Kurtosis | 940.09239 |
| Mean | 194.21373 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 25.258473 |
| Sum | 19203659 |
| Variance | 3421920.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 88351 | |
| 100 | 571 | 0.6% |
| 500 | 506 | 0.5% |
| 200 | 481 | 0.5% |
| 1000 | 474 | 0.5% |
| 50 | 402 | 0.4% |
| 250 | 281 | 0.3% |
| 300 | 271 | 0.3% |
| 150 | 262 | 0.3% |
| 2000 | 246 | 0.2% |
| Other values (1130) | 7034 | 7.1% |
| Value | Count | Frequency (%) |
| 0 | 88351 | |
| 1 | 233 | 0.2% |
| 2 | 97 | 0.1% |
| 3 | 53 | 0.1% |
| 4 | 37 | < 0.1% |
| 5 | 84 | 0.1% |
| 6 | 48 | < 0.1% |
| 7 | 31 | < 0.1% |
| 8 | 41 | < 0.1% |
| 9 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 6 | |
| 90000 | 1 | < 0.1% |
| 81000 | 1 | < 0.1% |
| 75000 | 3 | < 0.1% |
| 60000 | 4 | |
| 57678 | 1 | < 0.1% |
| 55000 | 1 | < 0.1% |
| 51000 | 1 | < 0.1% |
| 50110 | 1 | < 0.1% |
| 50000 | 9 |
tax_filer_stat
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.5 MiB |
| Joint Filer | |
|---|---|
| Non-Filer | |
| Individual Filer |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.407094 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Joint Filer |
|---|---|
| 2nd row | Joint Filer |
| 3rd row | Non-Filer |
| 4th row | Individual Filer |
| 5th row | Individual Filer |
Common Values
| Value | Count | Frequency (%) |
| Joint Filer | 39734 | |
| Non-Filer | 36496 | |
| Individual Filer | 22649 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| filer | 62383 | |
| joint | 39734 | |
| non-filer | 36496 | |
| individual | 22649 | 14.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 183911 | |
| l | 121528 | |
| e | 98879 | |
| r | 98879 | |
| n | 98879 | |
| F | 98879 | |
| o | 76230 | 6.8% |
| 62383 | 5.5% | |
| d | 45298 | 4.0% |
| J | 39734 | 3.5% |
| Other values (7) | 203322 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 831285 | |
| Uppercase Letter | 197758 | 17.5% |
| Space Separator | 62383 | 5.5% |
| Dash Punctuation | 36496 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 183911 | |
| l | 121528 | |
| e | 98879 | |
| r | 98879 | |
| n | 98879 | |
| o | 76230 | |
| d | 45298 | 5.4% |
| t | 39734 | 4.8% |
| v | 22649 | 2.7% |
| u | 22649 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 98879 | |
| J | 39734 | |
| N | 36496 | 18.5% |
| I | 22649 | 11.5% |
Space Separator
| Value | Count | Frequency (%) |
| 62383 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 36496 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1029043 | |
| Common | 98879 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 183911 | |
| l | 121528 | |
| e | 98879 | |
| r | 98879 | |
| n | 98879 | |
| F | 98879 | |
| o | 76230 | |
| d | 45298 | 4.4% |
| J | 39734 | 3.9% |
| t | 39734 | 3.9% |
| Other values (5) | 127092 |
Common
| Value | Count | Frequency (%) |
| 62383 | ||
| - | 36496 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1127922 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 183911 | |
| l | 121528 | |
| e | 98879 | |
| r | 98879 | |
| n | 98879 | |
| F | 98879 | |
| o | 76230 | 6.8% |
| 62383 | 5.5% | |
| d | 45298 | 4.0% |
| J | 39734 | 3.5% |
| Other values (7) | 203322 |
region_of_previous_residence
Categorical
High correlation  Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 MiB |
| Not in universe | |
|---|---|
| South | 2422 |
| West | 2041 |
| Midwest | 1694 |
| Northeast | 1322 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.292337 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 91198 | |
| South | 2422 | 2.4% |
| West | 2041 | 2.1% |
| Midwest | 1694 | 1.7% |
| Northeast | 1322 | 1.3% |
| Abroad | 202 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 91198 | |
| in | 91198 | |
| universe | 91198 | |
| south | 2422 | 0.9% |
| west | 2041 | 0.7% |
| midwest | 1694 | 0.6% |
| northeast | 1322 | 0.5% |
| abroad | 202 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 281275 | ||
| e | 187453 | |
| i | 184090 | |
| n | 182396 | |
| t | 99999 | 6.6% |
| s | 96255 | 6.4% |
| o | 95144 | 6.3% |
| u | 93620 | 6.2% |
| r | 92722 | 6.1% |
| N | 92520 | 6.1% |
| Other values (10) | 106617 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1131937 | |
| Space Separator | 281275 | 18.6% |
| Uppercase Letter | 98879 | 6.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 187453 | |
| i | 184090 | |
| n | 182396 | |
| t | 99999 | |
| s | 96255 | |
| o | 95144 | |
| u | 93620 | |
| r | 92722 | |
| v | 91198 | |
| h | 3744 | 0.3% |
| Other values (4) | 5316 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 92520 | |
| S | 2422 | 2.4% |
| W | 2041 | 2.1% |
| M | 1694 | 1.7% |
| A | 202 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 281275 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1230816 | |
| Common | 281275 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 187453 | |
| i | 184090 | |
| n | 182396 | |
| t | 99999 | |
| s | 96255 | |
| o | 95144 | |
| u | 93620 | |
| r | 92722 | |
| N | 92520 | |
| v | 91198 | |
| Other values (9) | 15419 | 1.3% |
Common
| Value | Count | Frequency (%) |
| 281275 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1512091 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 281275 | ||
| e | 187453 | |
| i | 184090 | |
| n | 182396 | |
| t | 99999 | 6.6% |
| s | 96255 | 6.4% |
| o | 95144 | 6.3% |
| u | 93620 | 6.2% |
| r | 92722 | 6.1% |
| N | 92520 | 6.1% |
| Other values (10) | 106617 | 7.1% |
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 16 |
| Mean length | 15.47016 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
| Value | Count | Frequency (%) |
| not | 91198 | |
| universe | 91198 | |
| in | 91198 | |
| california | 881 | 0.3% |
| north | 623 | 0.2% |
| utah | 533 | 0.2% |
| new | 489 | 0.2% |
| florida | 450 | 0.2% |
| carolina | 447 | 0.2% |
| 330 | 0.1% | |
| Other values (46) | 5379 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 282726 | ||
| i | 188701 | |
| n | 187296 | |
| e | 185188 | |
| o | 96908 | 6.3% |
| r | 95309 | 6.2% |
| t | 93880 | 6.1% |
| s | 93824 | 6.1% |
| N | 92491 | 6.0% |
| u | 91818 | 6.0% |
| Other values (36) | 121533 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1146674 | |
| Space Separator | 282726 | 18.5% |
| Uppercase Letter | 99944 | 6.5% |
| Other Punctuation | 330 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 188701 | |
| n | 187296 | |
| e | 185188 | |
| o | 96908 | |
| r | 95309 | |
| t | 93880 | |
| s | 93824 | |
| u | 91818 | |
| v | 91377 | |
| a | 9436 | 0.8% |
| Other values (14) | 12937 | 1.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 92491 | |
| C | 1563 | 1.6% |
| M | 1192 | 1.2% |
| A | 732 | 0.7% |
| U | 533 | 0.5% |
| O | 504 | 0.5% |
| I | 461 | 0.5% |
| F | 450 | 0.5% |
| D | 404 | 0.4% |
| W | 267 | 0.3% |
| Other values (10) | 1347 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 282726 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 330 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1246618 | |
| Common | 283056 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 188701 | |
| n | 187296 | |
| e | 185188 | |
| o | 96908 | |
| r | 95309 | |
| t | 93880 | |
| s | 93824 | |
| N | 92491 | |
| u | 91818 | |
| v | 91377 | |
| Other values (34) | 29826 | 2.4% |
Common
| Value | Count | Frequency (%) |
| 282726 | ||
| ? | 330 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1529674 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 282726 | ||
| i | 188701 | |
| n | 187296 | |
| e | 185188 | |
| o | 96908 | 6.3% |
| r | 95309 | 6.2% |
| t | 93880 | 6.1% |
| s | 93824 | 6.1% |
| N | 92491 | 6.0% |
| u | 91818 | 6.0% |
| Other values (36) | 121533 |
detailed_household_and_family_stat
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.7 MiB |
| Primary Householder | |
|---|---|
| Child | |
| Extended Family | 4810 |
| Other | 3597 |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 13.780065 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Primary Householder |
|---|---|
| 2nd row | Primary Householder |
| 3rd row | Child |
| 4th row | Primary Householder |
| 5th row | Other |
Common Values
| Value | Count | Frequency (%) |
| Primary Householder | 58576 | |
| Child | 31896 | |
| Extended Family | 4810 | 4.9% |
| Other | 3597 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| primary | 58576 | |
| householder | 58576 | |
| child | 31896 | |
| extended | 4810 | 3.0% |
| family | 4810 | 3.0% |
| other | 3597 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 179325 | |
| e | 130369 | 9.6% |
| o | 117152 | 8.6% |
| d | 100092 | 7.3% |
| i | 95282 | 7.0% |
| l | 95282 | 7.0% |
| h | 94069 | 6.9% |
| m | 63386 | 4.7% |
| a | 63386 | 4.7% |
| y | 63386 | 4.7% |
| Other values (12) | 360830 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1136908 | |
| Uppercase Letter | 162265 | 11.9% |
| Space Separator | 63386 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 179325 | |
| e | 130369 | |
| o | 117152 | |
| d | 100092 | |
| i | 95282 | |
| l | 95282 | |
| h | 94069 | |
| m | 63386 | 5.6% |
| a | 63386 | 5.6% |
| y | 63386 | 5.6% |
| Other values (5) | 135179 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 58576 | |
| H | 58576 | |
| C | 31896 | |
| E | 4810 | 3.0% |
| F | 4810 | 3.0% |
| O | 3597 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 63386 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1299173 | |
| Common | 63386 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 179325 | |
| e | 130369 | |
| o | 117152 | 9.0% |
| d | 100092 | 7.7% |
| i | 95282 | 7.3% |
| l | 95282 | 7.3% |
| h | 94069 | 7.2% |
| m | 63386 | 4.9% |
| a | 63386 | 4.9% |
| y | 63386 | 4.9% |
| Other values (11) | 297444 |
Common
| Value | Count | Frequency (%) |
| 63386 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1362559 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 179325 | |
| e | 130369 | 9.6% |
| o | 117152 | 8.6% |
| d | 100092 | 7.3% |
| i | 95282 | 7.0% |
| l | 95282 | 7.0% |
| h | 94069 | 6.9% |
| m | 63386 | 4.7% |
| a | 63386 | 4.7% |
| y | 63386 | 4.7% |
| Other values (12) | 360830 |
detailed_household_summary_in_household
Categorical
High correlation 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.3 MiB |
| Householder | |
|---|---|
| Child under 18 never married | |
| Spouse of householder | |
| Child 18 or older | |
| Other relative of householder | |
| Other values (3) |
Length
| Max length | 37 |
|---|---|
| Median length | 30 |
| Mean length | 20.173808 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Spouse of householder |
|---|---|
| 2nd row | Spouse of householder |
| 3rd row | Child under 18 never married |
| 4th row | Householder |
| 5th row | Nonrelative of householder |
Common Values
| Value | Count | Frequency (%) |
| Householder | 37939 | |
| Child under 18 never married | 24195 | |
| Spouse of householder | 20648 | |
| Child 18 or older | 7337 | 7.4% |
| Other relative of householder | 4813 | 4.9% |
| Nonrelative of householder | 3871 | 3.9% |
| Group Quarters- Secondary individual | 54 | 0.1% |
| Child under 18 ever married | 22 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| householder | 67271 | |
| child | 31554 | |
| 18 | 31554 | |
| of | 29332 | |
| under | 24217 | 8.6% |
| married | 24217 | 8.6% |
| never | 24195 | 8.6% |
| spouse | 20648 | 7.3% |
| older | 7337 | 2.6% |
| or | 7337 | 2.6% |
| Other values (8) | 13735 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 281684 | |
| 281397 | ||
| o | 203175 | |
| r | 192526 | |
| d | 154758 | |
| h | 132970 | 6.7% |
| l | 114900 | 5.8% |
| u | 112298 | 5.6% |
| s | 87973 | 4.4% |
| i | 64617 | 3.2% |
| Other values (19) | 368468 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1551220 | |
| Space Separator | 281397 | 14.1% |
| Uppercase Letter | 98987 | 5.0% |
| Decimal Number | 63108 | 3.2% |
| Dash Punctuation | 54 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 281684 | |
| o | 203175 | |
| r | 192526 | |
| d | 154758 | |
| h | 132970 | |
| l | 114900 | |
| u | 112298 | 7.2% |
| s | 87973 | 5.7% |
| i | 64617 | 4.2% |
| n | 52391 | 3.4% |
| Other values (8) | 153928 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 37939 | |
| C | 31554 | |
| S | 20702 | |
| O | 4813 | 4.9% |
| N | 3871 | 3.9% |
| G | 54 | 0.1% |
| Q | 54 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 31554 | |
| 1 | 31554 |
Space Separator
| Value | Count | Frequency (%) |
| 281397 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 54 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1650207 | |
| Common | 344559 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 281684 | |
| o | 203175 | |
| r | 192526 | |
| d | 154758 | |
| h | 132970 | |
| l | 114900 | |
| u | 112298 | 6.8% |
| s | 87973 | 5.3% |
| i | 64617 | 3.9% |
| n | 52391 | 3.2% |
| Other values (15) | 252915 |
Common
| Value | Count | Frequency (%) |
| 281397 | ||
| 8 | 31554 | 9.2% |
| 1 | 31554 | 9.2% |
| - | 54 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1994766 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 281684 | |
| 281397 | ||
| o | 203175 | |
| r | 192526 | |
| d | 154758 | |
| h | 132970 | 6.7% |
| l | 114900 | 5.8% |
| u | 112298 | 5.6% |
| s | 87973 | 4.4% |
| i | 64617 | 3.2% |
| Other values (19) | 368468 |
instance_weight
Real number (ℝ)
| Distinct | 64741 |
|---|---|
| Distinct (%) | 65.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1741.2191 |
| Minimum | 43.26 |
|---|---|
| Maximum | 16258.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 43.26 |
|---|---|
| 5-th percentile | 393.667 |
| Q1 | 1058.14 |
| median | 1616.89 |
| Q3 | 2190.47 |
| 95-th percentile | 3592.968 |
| Maximum | 16258.2 |
| Range | 16214.94 |
| Interquartile range (IQR) | 1132.33 |
Descriptive statistics
| Standard deviation | 996.25211 |
|---|---|
| Coefficient of variation (CV) | 0.5721578 |
| Kurtosis | 5.5538814 |
| Mean | 1741.2191 |
| Median Absolute Deviation (MAD) | 563.4 |
| Skewness | 1.4480311 |
| Sum | 1.7217001 × 108 |
| Variance | 992518.27 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 707.9 | 22 | < 0.1% |
| 1215.77 | 19 | < 0.1% |
| 1362.16 | 19 | < 0.1% |
| 1378.71 | 19 | < 0.1% |
| 1831.77 | 16 | < 0.1% |
| 1291.46 | 15 | < 0.1% |
| 2013.21 | 15 | < 0.1% |
| 1386.38 | 15 | < 0.1% |
| 1280.35 | 15 | < 0.1% |
| 2228.01 | 15 | < 0.1% |
| Other values (64731) | 98709 |
| Value | Count | Frequency (%) |
| 43.26 | 1 | |
| 47.83 | 1 | |
| 50.38 | 1 | |
| 50.46 | 1 | |
| 52.43 | 1 | |
| 53.7 | 2 | |
| 54.88 | 1 | |
| 56.45 | 1 | |
| 58.55 | 1 | |
| 58.65 | 1 |
| Value | Count | Frequency (%) |
| 16258.2 | 1 | |
| 14547.9 | 1 | |
| 13388.6 | 1 | |
| 13145.1 | 1 | |
| 12960.2 | 1 | |
| 12739.2 | 1 | |
| 12554.3 | 1 | |
| 11688.2 | 1 | |
| 11627.5 | 1 | |
| 11254.2 | 2 |
migration_code_change_in_msa
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.6 MiB |
| Not in universe | |
|---|---|
| No movement | |
| MSA movement | |
| Non-MSA movement | 1342 |
| Mixed movement | 658 |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 13.182283 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | No movement |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 50354 | |
| No movement | 41044 | |
| MSA movement | 5280 | 5.3% |
| Non-MSA movement | 1342 | 1.4% |
| Mixed movement | 658 | 0.7% |
| International | 201 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 50354 | |
| in | 50354 | |
| universe | 50354 | |
| movement | 48324 | |
| no | 41044 | |
| msa | 5280 | 2.1% |
| non-msa | 1342 | 0.5% |
| mixed | 658 | 0.3% |
| international | 201 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 198215 | |
| n | 150977 | |
| 149032 | ||
| o | 141265 | |
| i | 101567 | |
| t | 99080 | |
| v | 98678 | |
| m | 96648 | |
| N | 92740 | |
| r | 50555 | 3.9% |
| Other values (11) | 124694 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1039612 | |
| Space Separator | 149032 | 11.4% |
| Uppercase Letter | 113465 | 8.7% |
| Dash Punctuation | 1342 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 198215 | |
| n | 150977 | |
| o | 141265 | |
| i | 101567 | |
| t | 99080 | |
| v | 98678 | |
| m | 96648 | |
| r | 50555 | 4.9% |
| s | 50354 | 4.8% |
| u | 50354 | 4.8% |
| Other values (4) | 1919 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 92740 | |
| M | 7280 | 6.4% |
| S | 6622 | 5.8% |
| A | 6622 | 5.8% |
| I | 201 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 149032 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1342 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1153077 | |
| Common | 150374 | 11.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 198215 | |
| n | 150977 | |
| o | 141265 | |
| i | 101567 | |
| t | 99080 | |
| v | 98678 | |
| m | 96648 | |
| N | 92740 | |
| r | 50555 | 4.4% |
| s | 50354 | 4.4% |
| Other values (9) | 72998 | 6.3% |
Common
| Value | Count | Frequency (%) |
| 149032 | ||
| - | 1342 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1303451 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 198215 | |
| n | 150977 | |
| 149032 | ||
| o | 141265 | |
| i | 101567 | |
| t | 99080 | |
| v | 98678 | |
| m | 96648 | |
| N | 92740 | |
| r | 50555 | 3.9% |
| Other values (11) | 124694 |
migration_code_change_in_reg
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.5 MiB |
| Not in universe | |
|---|---|
| Same area | |
| Different area | 2813 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 12.185601 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Same area |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 50154 | |
| Same area | 45912 | |
| Different area | 2813 | 2.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 50154 | |
| in | 50154 | |
| universe | 50154 | |
| area | 48725 | |
| same | 45912 | |
| different | 2813 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 200571 | |
| 149033 | ||
| a | 143362 | |
| i | 103121 | |
| n | 103121 | |
| r | 101692 | |
| t | 52967 | 4.4% |
| N | 50154 | 4.2% |
| o | 50154 | 4.2% |
| u | 50154 | 4.2% |
| Other values (6) | 200571 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 956988 | |
| Space Separator | 149033 | 12.4% |
| Uppercase Letter | 98879 | 8.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 200571 | |
| a | 143362 | |
| i | 103121 | |
| n | 103121 | |
| r | 101692 | |
| t | 52967 | 5.5% |
| o | 50154 | 5.2% |
| u | 50154 | 5.2% |
| v | 50154 | 5.2% |
| s | 50154 | 5.2% |
| Other values (2) | 51538 | 5.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 50154 | |
| S | 45912 | |
| D | 2813 | 2.8% |
Space Separator
| Value | Count | Frequency (%) |
| 149033 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1055867 | |
| Common | 149033 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 200571 | |
| a | 143362 | |
| i | 103121 | |
| n | 103121 | |
| r | 101692 | |
| t | 52967 | 5.0% |
| N | 50154 | 4.8% |
| o | 50154 | 4.8% |
| u | 50154 | 4.8% |
| v | 50154 | 4.8% |
| Other values (5) | 150417 |
Common
| Value | Count | Frequency (%) |
| 149033 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1204900 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 200571 | |
| 149033 | ||
| a | 143362 | |
| i | 103121 | |
| n | 103121 | |
| r | 101692 | |
| t | 52967 | 4.4% |
| N | 50154 | 4.2% |
| o | 50154 | 4.2% |
| u | 50154 | 4.2% |
| Other values (6) | 200571 |
migration_code_move_within_reg
Categorical
High correlation  Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| ? | |
|---|---|
| Nonmover | |
| Same county | 4868 |
| Different county same state | 1328 |
| Not in universe | 684 |
| Other values (5) | 1485 |
Length
| Max length | 29 |
|---|---|
| Median length | 2 |
| Mean length | 6.1623499 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ? |
|---|---|
| 2nd row | ? |
| 3rd row | ? |
| 4th row | Nonmover |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| ? | 49470 | |
| Nonmover | 41044 | |
| Same county | 4868 | 4.9% |
| Different county same state | 1328 | 1.3% |
| Not in universe | 684 | 0.7% |
| Different state in South | 475 | 0.5% |
| Different state in West | 358 | 0.4% |
| Different state in Midwest | 242 | 0.2% |
| Different state in Northeast | 208 | 0.2% |
| Abroad | 202 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 49470 | ||
| nonmover | 41044 | |
| same | 6196 | 5.5% |
| county | 6196 | 5.5% |
| different | 2611 | 2.3% |
| state | 2611 | 2.3% |
| in | 1967 | 1.7% |
| not | 684 | 0.6% |
| universe | 684 | 0.6% |
| south | 475 | 0.4% |
| Other values (4) | 1010 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 112948 | ||
| o | 89853 | |
| e | 57249 | |
| n | 52502 | |
| ? | 49470 | |
| m | 47240 | |
| r | 44749 | 7.3% |
| N | 41936 | 6.9% |
| v | 41728 | 6.8% |
| t | 16204 | 2.7% |
| Other values (16) | 55448 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 396217 | |
| Space Separator | 112948 | 18.5% |
| Uppercase Letter | 50692 | 8.3% |
| Other Punctuation | 49470 | 8.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 89853 | |
| e | 57249 | |
| n | 52502 | |
| m | 47240 | |
| r | 44749 | |
| v | 41728 | |
| t | 16204 | 4.1% |
| a | 9217 | 2.3% |
| u | 7355 | 1.9% |
| c | 6196 | 1.6% |
| Other values (8) | 23924 | 6.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 41936 | |
| S | 5343 | 10.5% |
| D | 2611 | 5.2% |
| W | 358 | 0.7% |
| M | 242 | 0.5% |
| A | 202 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 112948 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 49470 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 446909 | |
| Common | 162418 | 26.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 89853 | |
| e | 57249 | |
| n | 52502 | |
| m | 47240 | |
| r | 44749 | |
| N | 41936 | |
| v | 41728 | |
| t | 16204 | 3.6% |
| a | 9217 | 2.1% |
| u | 7355 | 1.6% |
| Other values (14) | 38876 |
Common
| Value | Count | Frequency (%) |
| 112948 | ||
| ? | 49470 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 609327 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 112948 | ||
| o | 89853 | |
| e | 57249 | |
| n | 52502 | |
| ? | 49470 | |
| m | 47240 | |
| r | 44749 | 7.3% |
| N | 41936 | 6.9% |
| v | 41728 | 6.8% |
| t | 16204 | 2.7% |
| Other values (16) | 55448 |
live_in_this_house_1_year_ago
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.3 MiB |
| Not in universe | |
|---|---|
| Yes | |
| No |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 9.5018052 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Yes |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 50154 | |
| Yes | 41044 | |
| No | 7681 | 7.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 50154 | |
| in | 50154 | |
| universe | 50154 | |
| yes | 41044 | |
| no | 7681 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 149033 | ||
| e | 141352 | |
| i | 100308 | |
| n | 100308 | |
| s | 91198 | |
| N | 57835 | 6.2% |
| o | 57835 | 6.2% |
| t | 50154 | 5.3% |
| u | 50154 | 5.3% |
| v | 50154 | 5.3% |
| Other values (2) | 91198 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 691617 | |
| Space Separator | 149033 | 15.9% |
| Uppercase Letter | 98879 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 141352 | |
| i | 100308 | |
| n | 100308 | |
| s | 91198 | |
| o | 57835 | |
| t | 50154 | 7.3% |
| u | 50154 | 7.3% |
| v | 50154 | 7.3% |
| r | 50154 | 7.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 57835 | |
| Y | 41044 |
Space Separator
| Value | Count | Frequency (%) |
| 149033 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 790496 | |
| Common | 149033 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 141352 | |
| i | 100308 | |
| n | 100308 | |
| s | 91198 | |
| N | 57835 | |
| o | 57835 | |
| t | 50154 | 6.3% |
| u | 50154 | 6.3% |
| v | 50154 | 6.3% |
| r | 50154 | 6.3% |
Common
| Value | Count | Frequency (%) |
| 149033 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 939529 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 149033 | ||
| e | 141352 | |
| i | 100308 | |
| n | 100308 | |
| s | 91198 | |
| N | 57835 | 6.2% |
| o | 57835 | 6.2% |
| t | 50154 | 5.3% |
| u | 50154 | 5.3% |
| v | 50154 | 5.3% |
| Other values (2) | 91198 |
migration_prev_res_in_sunbelt
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.7 MiB |
| Not in universe | |
|---|---|
| No | 4803 |
| Yes | 2878 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.096937 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 91198 | |
| No | 4803 | 4.9% |
| Yes | 2878 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 91198 | |
| in | 91198 | |
| universe | 91198 | |
| no | 4803 | 1.7% |
| yes | 2878 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 190077 | ||
| e | 185274 | |
| i | 182396 | |
| n | 182396 | |
| N | 96001 | |
| o | 96001 | |
| s | 94076 | |
| t | 91198 | |
| u | 91198 | |
| v | 91198 | |
| Other values (2) | 94076 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1104935 | |
| Space Separator | 190077 | 13.6% |
| Uppercase Letter | 98879 | 7.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 185274 | |
| i | 182396 | |
| n | 182396 | |
| o | 96001 | |
| s | 94076 | |
| t | 91198 | |
| u | 91198 | |
| v | 91198 | |
| r | 91198 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 96001 | |
| Y | 2878 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 190077 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1203814 | |
| Common | 190077 | 13.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 185274 | |
| i | 182396 | |
| n | 182396 | |
| N | 96001 | |
| o | 96001 | |
| s | 94076 | |
| t | 91198 | |
| u | 91198 | |
| v | 91198 | |
| r | 91198 |
Common
| Value | Count | Frequency (%) |
| 190077 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1393891 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 190077 | ||
| e | 185274 | |
| i | 182396 | |
| n | 182396 | |
| N | 96001 | |
| o | 96001 | |
| s | 94076 | |
| t | 91198 | |
| u | 91198 | |
| v | 91198 | |
| Other values (2) | 94076 |
num_persons_worked_for_employer
Real number (ℝ)
High correlation  Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9735839 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 47009 |
| Zeros (%) | 47.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.3676159 |
|---|---|
| Coefficient of variation (CV) | 1.199653 |
| Kurtosis | -1.0968823 |
| Mean | 1.9735839 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.74035049 |
| Sum | 195146 |
| Variance | 5.6056049 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 47009 | |
| 6 | 18328 | 18.5% |
| 1 | 11641 | 11.8% |
| 4 | 7059 | 7.1% |
| 3 | 6836 | 6.9% |
| 2 | 5079 | 5.1% |
| 5 | 2927 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 47009 | |
| 1 | 11641 | 11.8% |
| 2 | 5079 | 5.1% |
| 3 | 6836 | 6.9% |
| 4 | 7059 | 7.1% |
| 5 | 2927 | 3.0% |
| 6 | 18328 | 18.5% |
| Value | Count | Frequency (%) |
| 6 | 18328 | 18.5% |
| 5 | 2927 | 3.0% |
| 4 | 7059 | 7.1% |
| 3 | 6836 | 6.9% |
| 2 | 5079 | 5.1% |
| 1 | 11641 | 11.8% |
| 0 | 47009 |
family_members_under_18
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.0 MiB |
| Not in universe | |
|---|---|
| Both parents present | |
| Mother only present | 6181 |
| Father only present | 941 |
| Neither parent present | 805 |
Length
| Max length | 23 |
|---|---|
| Median length | 16 |
| Mean length | 17.284631 |
| Min length | 16 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Both parents present |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 72372 | |
| Both parents present | 18580 | 18.8% |
| Mother only present | 6181 | 6.3% |
| Father only present | 941 | 1.0% |
| Neither parent present | 805 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 72372 | |
| in | 72372 | |
| universe | 72372 | |
| present | 26507 | 8.9% |
| both | 18580 | 6.3% |
| parents | 18580 | 6.3% |
| only | 7122 | 2.4% |
| mother | 6181 | 2.1% |
| father | 941 | 0.3% |
| neither | 805 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 296637 | ||
| e | 225875 | |
| n | 197758 | |
| i | 145549 | |
| t | 144771 | |
| r | 126191 | |
| s | 117459 | 6.9% |
| o | 104255 | 6.1% |
| N | 73177 | 4.3% |
| u | 72372 | 4.2% |
| Other values (9) | 205043 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1313571 | |
| Space Separator | 296637 | 17.4% |
| Uppercase Letter | 98879 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 225875 | |
| n | 197758 | |
| i | 145549 | |
| t | 144771 | |
| r | 126191 | |
| s | 117459 | |
| o | 104255 | |
| u | 72372 | 5.5% |
| v | 72372 | 5.5% |
| p | 45892 | 3.5% |
| Other values (4) | 61077 | 4.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 73177 | |
| B | 18580 | 18.8% |
| M | 6181 | 6.3% |
| F | 941 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 296637 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1412450 | |
| Common | 296637 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 225875 | |
| n | 197758 | |
| i | 145549 | |
| t | 144771 | |
| r | 126191 | |
| s | 117459 | |
| o | 104255 | |
| N | 73177 | 5.2% |
| u | 72372 | 5.1% |
| v | 72372 | 5.1% |
| Other values (8) | 132671 |
Common
| Value | Count | Frequency (%) |
| 296637 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1709087 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 296637 | ||
| e | 225875 | |
| n | 197758 | |
| i | 145549 | |
| t | 144771 | |
| r | 126191 | |
| s | 117459 | 6.9% |
| o | 104255 | 6.1% |
| N | 73177 | 4.3% |
| u | 72372 | 4.2% |
| Other values (9) | 205043 |
country_of_birth_father
Categorical
High correlation  Imbalance 
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.6 MiB |
| United-States | |
|---|---|
| Mexico | 5030 |
| ? | 3425 |
| Puerto-Rico | 1285 |
| Italy | 1119 |
| Other values (38) |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 12.644738 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mexico |
|---|---|
| 2nd row | United-States |
| 3rd row | United-States |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 78519 | |
| Mexico | 5030 | 5.1% |
| ? | 3425 | 3.5% |
| Puerto-Rico | 1285 | 1.3% |
| Italy | 1119 | 1.1% |
| Dominican-Republic | 681 | 0.7% |
| Canada | 659 | 0.7% |
| Germany | 648 | 0.7% |
| Poland | 629 | 0.6% |
| Philippines | 591 | 0.6% |
| Other values (33) | 6293 | 6.4% |
Length
| Value | Count | Frequency (%) |
| united-states | 78519 | |
| mexico | 5030 | 5.1% |
| 3425 | 3.4% | |
| puerto-rico | 1285 | 1.3% |
| italy | 1119 | 1.1% |
| dominican-republic | 681 | 0.7% |
| canada | 659 | 0.7% |
| germany | 648 | 0.7% |
| poland | 629 | 0.6% |
| philippines | 591 | 0.6% |
| Other values (39) | 6904 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 239312 | |
| e | 167264 | |
| 99490 | ||
| a | 92045 | 7.4% |
| i | 91125 | 7.3% |
| n | 85629 | 6.8% |
| d | 82073 | 6.6% |
| - | 81083 | 6.5% |
| S | 79547 | 6.4% |
| s | 79441 | 6.4% |
| Other values (37) | 153290 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 888711 | |
| Uppercase Letter | 177369 | 14.2% |
| Space Separator | 99490 | 8.0% |
| Dash Punctuation | 81083 | 6.5% |
| Other Punctuation | 3494 | 0.3% |
| Open Punctuation | 76 | < 0.1% |
| Close Punctuation | 76 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 239312 | |
| e | 167264 | |
| a | 92045 | 10.4% |
| i | 91125 | 10.3% |
| n | 85629 | 9.6% |
| d | 82073 | 9.2% |
| s | 79441 | 8.9% |
| o | 11411 | 1.3% |
| c | 8784 | 1.0% |
| l | 5843 | 0.7% |
| Other values (11) | 25784 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 79547 | |
| U | 78671 | |
| M | 5030 | 2.8% |
| P | 2876 | 1.6% |
| C | 2048 | 1.2% |
| R | 1966 | 1.1% |
| I | 1912 | 1.1% |
| G | 1161 | 0.7% |
| E | 1076 | 0.6% |
| D | 681 | 0.4% |
| Other values (10) | 2401 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 3425 | |
| & | 69 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 99490 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 81083 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 76 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 76 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1066080 | |
| Common | 184219 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 239312 | |
| e | 167264 | |
| a | 92045 | 8.6% |
| i | 91125 | 8.5% |
| n | 85629 | 8.0% |
| d | 82073 | 7.7% |
| S | 79547 | 7.5% |
| s | 79441 | 7.5% |
| U | 78671 | 7.4% |
| o | 11411 | 1.1% |
| Other values (31) | 59562 | 5.6% |
Common
| Value | Count | Frequency (%) |
| 99490 | ||
| - | 81083 | |
| ? | 3425 | 1.9% |
| ( | 76 | < 0.1% |
| ) | 76 | < 0.1% |
| & | 69 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1250299 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 239312 | |
| e | 167264 | |
| 99490 | ||
| a | 92045 | 7.4% |
| i | 91125 | 7.3% |
| n | 85629 | 6.8% |
| d | 82073 | 6.6% |
| - | 81083 | 6.5% |
| S | 79547 | 6.4% |
| s | 79441 | 6.4% |
| Other values (37) | 153290 |
country_of_birth_mother
Categorical
High correlation  Imbalance 
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.6 MiB |
| United-States | |
|---|---|
| Mexico | 4969 |
| ? | 3070 |
| Puerto-Rico | 1190 |
| Italy | 917 |
| Other values (38) |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 12.69044 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mexico |
|---|---|
| 2nd row | United-States |
| 3rd row | United-States |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 79165 | |
| Mexico | 4969 | 5.0% |
| ? | 3070 | 3.1% |
| Puerto-Rico | 1190 | 1.2% |
| Italy | 917 | 0.9% |
| Canada | 702 | 0.7% |
| Germany | 676 | 0.7% |
| Philippines | 644 | 0.7% |
| Cuba | 594 | 0.6% |
| Poland | 585 | 0.6% |
| Other values (33) | 6367 | 6.4% |
Length
| Value | Count | Frequency (%) |
| united-states | 79165 | |
| mexico | 4969 | 5.0% |
| 3070 | 3.1% | |
| puerto-rico | 1190 | 1.2% |
| italy | 917 | 0.9% |
| canada | 702 | 0.7% |
| germany | 676 | 0.7% |
| philippines | 644 | 0.6% |
| cuba | 594 | 0.6% |
| poland | 585 | 0.6% |
| Other values (39) | 6961 | 7.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 240965 | |
| e | 168270 | |
| 99473 | ||
| a | 92713 | 7.4% |
| i | 91332 | 7.3% |
| n | 86249 | 6.9% |
| d | 82819 | 6.6% |
| - | 81508 | 6.5% |
| S | 80234 | 6.4% |
| s | 80116 | 6.4% |
| Other values (37) | 151139 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 892492 | |
| Uppercase Letter | 178093 | 14.2% |
| Space Separator | 99473 | 7.9% |
| Dash Punctuation | 81508 | 6.5% |
| Other Punctuation | 3128 | 0.2% |
| Open Punctuation | 62 | < 0.1% |
| Close Punctuation | 62 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 240965 | |
| e | 168270 | |
| a | 92713 | 10.4% |
| i | 91332 | 10.2% |
| n | 86249 | 9.7% |
| d | 82819 | 9.3% |
| s | 80116 | 9.0% |
| o | 11006 | 1.2% |
| c | 8292 | 0.9% |
| l | 5603 | 0.6% |
| Other values (11) | 25127 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 80234 | |
| U | 79289 | |
| M | 4969 | 2.8% |
| P | 2784 | 1.6% |
| C | 2054 | 1.2% |
| R | 1728 | 1.0% |
| I | 1721 | 1.0% |
| E | 1161 | 0.7% |
| G | 1122 | 0.6% |
| D | 538 | 0.3% |
| Other values (10) | 2493 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 3070 | |
| & | 58 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 99473 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 81508 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 62 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 62 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1070585 | |
| Common | 184233 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 240965 | |
| e | 168270 | |
| a | 92713 | 8.7% |
| i | 91332 | 8.5% |
| n | 86249 | 8.1% |
| d | 82819 | 7.7% |
| S | 80234 | 7.5% |
| s | 80116 | 7.5% |
| U | 79289 | 7.4% |
| o | 11006 | 1.0% |
| Other values (31) | 57592 | 5.4% |
Common
| Value | Count | Frequency (%) |
| 99473 | ||
| - | 81508 | |
| ? | 3070 | 1.7% |
| ( | 62 | < 0.1% |
| ) | 62 | < 0.1% |
| & | 58 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1254818 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 240965 | |
| e | 168270 | |
| 99473 | ||
| a | 92713 | 7.4% |
| i | 91332 | 7.3% |
| n | 86249 | 6.9% |
| d | 82819 | 6.6% |
| - | 81508 | 6.5% |
| S | 80234 | 6.4% |
| s | 80116 | 6.4% |
| Other values (37) | 151139 |
country_of_birth_self
Categorical
High correlation  Imbalance 
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.6 MiB |
| United-States | |
|---|---|
| Mexico | 2934 |
| ? | 1763 |
| Puerto-Rico | 691 |
| Philippines | 454 |
| Other values (38) | 5559 |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 13.259337 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mexico |
|---|---|
| 2nd row | United-States |
| 3rd row | United-States |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 87478 | |
| Mexico | 2934 | 3.0% |
| ? | 1763 | 1.8% |
| Puerto-Rico | 691 | 0.7% |
| Philippines | 454 | 0.5% |
| Cuba | 428 | 0.4% |
| Germany | 420 | 0.4% |
| Canada | 346 | 0.3% |
| El-Salvador | 342 | 0.3% |
| Dominican-Republic | 327 | 0.3% |
| Other values (33) | 3696 | 3.7% |
Length
| Value | Count | Frequency (%) |
| united-states | 87478 | |
| mexico | 2934 | 3.0% |
| 1763 | 1.8% | |
| puerto-rico | 691 | 0.7% |
| philippines | 454 | 0.5% |
| cuba | 428 | 0.4% |
| germany | 420 | 0.4% |
| canada | 346 | 0.3% |
| el-salvador | 342 | 0.3% |
| dominican-republic | 327 | 0.3% |
| Other values (39) | 4168 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 264267 | |
| e | 180908 | |
| 99351 | 7.6% | |
| a | 95334 | 7.3% |
| i | 95149 | 7.3% |
| n | 91503 | 7.0% |
| d | 89343 | 6.8% |
| - | 88895 | 6.8% |
| S | 88183 | 6.7% |
| s | 88122 | 6.7% |
| Other values (37) | 130015 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 934298 | |
| Uppercase Letter | 186623 | 14.2% |
| Space Separator | 99351 | 7.6% |
| Dash Punctuation | 88895 | 6.8% |
| Other Punctuation | 1811 | 0.1% |
| Open Punctuation | 46 | < 0.1% |
| Close Punctuation | 46 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 264267 | |
| e | 180908 | |
| a | 95334 | 10.2% |
| i | 95149 | 10.2% |
| n | 91503 | 9.8% |
| d | 89343 | 9.6% |
| s | 88122 | 9.4% |
| o | 6555 | 0.7% |
| c | 4933 | 0.5% |
| x | 2934 | 0.3% |
| Other values (11) | 15250 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 88183 | |
| U | 87570 | |
| M | 2934 | 1.6% |
| P | 1542 | 0.8% |
| C | 1285 | 0.7% |
| R | 1018 | 0.5% |
| G | 707 | 0.4% |
| E | 701 | 0.4% |
| I | 627 | 0.3% |
| J | 354 | 0.2% |
| Other values (10) | 1702 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 1763 | |
| & | 48 | 2.7% |
Space Separator
| Value | Count | Frequency (%) |
| 99351 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 88895 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 46 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 46 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1120921 | |
| Common | 190149 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 264267 | |
| e | 180908 | |
| a | 95334 | 8.5% |
| i | 95149 | 8.5% |
| n | 91503 | 8.2% |
| d | 89343 | 8.0% |
| S | 88183 | 7.9% |
| s | 88122 | 7.9% |
| U | 87570 | 7.8% |
| o | 6555 | 0.6% |
| Other values (31) | 33987 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 99351 | ||
| - | 88895 | |
| ? | 1763 | 0.9% |
| & | 48 | < 0.1% |
| ( | 46 | < 0.1% |
| ) | 46 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1311070 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 264267 | |
| e | 180908 | |
| 99351 | 7.6% | |
| a | 95334 | 7.3% |
| i | 95149 | 7.3% |
| n | 91503 | 7.0% |
| d | 89343 | 6.8% |
| - | 88895 | 6.8% |
| S | 88183 | 6.7% |
| s | 88122 | 6.7% |
| Other values (37) | 130015 |
citizenship
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| Native | |
|---|---|
| Foreign | 6699 |
| Naturalized | 3012 |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.2200568 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Foreign |
|---|---|
| 2nd row | Native |
| 3rd row | Native |
| 4th row | Native |
| 5th row | Native |
Common Values
| Value | Count | Frequency (%) |
| Native | 89168 | |
| Foreign | 6699 | 6.8% |
| Naturalized | 3012 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| native | 89168 | |
| foreign | 6699 | 6.8% |
| naturalized | 3012 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 98879 | |
| e | 98879 | |
| a | 95192 | |
| N | 92180 | |
| t | 92180 | |
| v | 89168 | |
| r | 9711 | 1.6% |
| F | 6699 | 1.1% |
| o | 6699 | 1.1% |
| g | 6699 | 1.1% |
| Other values (5) | 18747 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 516154 | |
| Uppercase Letter | 98879 | 16.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 98879 | |
| e | 98879 | |
| a | 95192 | |
| t | 92180 | |
| v | 89168 | |
| r | 9711 | 1.9% |
| o | 6699 | 1.3% |
| g | 6699 | 1.3% |
| n | 6699 | 1.3% |
| u | 3012 | 0.6% |
| Other values (3) | 9036 | 1.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 92180 | |
| F | 6699 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 615033 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 98879 | |
| e | 98879 | |
| a | 95192 | |
| N | 92180 | |
| t | 92180 | |
| v | 89168 | |
| r | 9711 | 1.6% |
| F | 6699 | 1.1% |
| o | 6699 | 1.1% |
| g | 6699 | 1.1% |
| Other values (5) | 18747 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 615033 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 98879 | |
| e | 98879 | |
| a | 95192 | |
| N | 92180 | |
| t | 92180 | |
| v | 89168 | |
| r | 9711 | 1.6% |
| F | 6699 | 1.1% |
| o | 6699 | 1.1% |
| g | 6699 | 1.1% |
| Other values (5) | 18747 | 3.0% |
own_business_or_self_employed
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.7 MiB |
| Not in universe | |
|---|---|
| No | 8234 |
| Yes | 1340 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.754822 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | No |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 89305 | |
| No | 8234 | 8.3% |
| Yes | 1340 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 89305 | |
| in | 89305 | |
| universe | 89305 | |
| no | 8234 | 3.0% |
| yes | 1340 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 179950 | |
| 178610 | ||
| i | 178610 | |
| n | 178610 | |
| N | 97539 | |
| o | 97539 | |
| s | 90645 | |
| t | 89305 | |
| u | 89305 | |
| v | 89305 | |
| Other values (2) | 90645 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1082574 | |
| Space Separator | 178610 | 13.1% |
| Uppercase Letter | 98879 | 7.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 179950 | |
| i | 178610 | |
| n | 178610 | |
| o | 97539 | |
| s | 90645 | |
| t | 89305 | |
| u | 89305 | |
| v | 89305 | |
| r | 89305 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97539 | |
| Y | 1340 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 178610 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1181453 | |
| Common | 178610 | 13.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 179950 | |
| i | 178610 | |
| n | 178610 | |
| N | 97539 | |
| o | 97539 | |
| s | 90645 | |
| t | 89305 | |
| u | 89305 | |
| v | 89305 | |
| r | 89305 |
Common
| Value | Count | Frequency (%) |
| 178610 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1360063 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 179950 | |
| 178610 | ||
| i | 178610 | |
| n | 178610 | |
| N | 97539 | |
| o | 97539 | |
| s | 90645 | |
| t | 89305 | |
| u | 89305 | |
| v | 89305 | |
| Other values (2) | 90645 |
fill_inc_questionnaire_for_veteran's_admin
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
| Not in universe | |
|---|---|
| No | 828 |
| Yes | 199 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.866989 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 97852 | |
| No | 828 | 0.8% |
| Yes | 199 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 97852 | |
| in | 97852 | |
| universe | 97852 | |
| no | 828 | 0.3% |
| yes | 199 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 294583 | ||
| e | 195903 | |
| i | 195704 | |
| n | 195704 | |
| N | 98680 | 6.3% |
| o | 98680 | 6.3% |
| s | 98051 | 6.2% |
| t | 97852 | 6.2% |
| u | 97852 | 6.2% |
| v | 97852 | 6.2% |
| Other values (2) | 98051 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1175450 | |
| Space Separator | 294583 | 18.8% |
| Uppercase Letter | 98879 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 195903 | |
| i | 195704 | |
| n | 195704 | |
| o | 98680 | |
| s | 98051 | |
| t | 97852 | |
| u | 97852 | |
| v | 97852 | |
| r | 97852 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 98680 | |
| Y | 199 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 294583 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1274329 | |
| Common | 294583 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 195903 | |
| i | 195704 | |
| n | 195704 | |
| N | 98680 | |
| o | 98680 | |
| s | 98051 | |
| t | 97852 | |
| u | 97852 | |
| v | 97852 | |
| r | 97852 |
Common
| Value | Count | Frequency (%) |
| 294583 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1568912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 294583 | ||
| e | 195903 | |
| i | 195704 | |
| n | 195704 | |
| N | 98680 | 6.3% |
| o | 98680 | 6.3% |
| s | 98051 | 6.2% |
| t | 97852 | 6.2% |
| u | 97852 | 6.2% |
| v | 97852 | 6.2% |
| Other values (2) | 98051 | 6.2% |
veterans_benefits
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.6 MiB |
| Not a Veteran | |
|---|---|
| Not in universe | |
| Veteran | 1027 |
Length
| Max length | 15 |
|---|---|
| Median length | 13 |
| Mean length | 13.394725 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not a Veteran |
|---|---|
| 2nd row | Not a Veteran |
| 3rd row | Not in universe |
| 4th row | Not a Veteran |
| 5th row | Not a Veteran |
Common Values
| Value | Count | Frequency (%) |
| Not a Veteran | 75256 | |
| Not in universe | 22596 | 22.9% |
| Veteran | 1027 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 97852 | |
| veteran | 76283 | |
| a | 75256 | |
| in | 22596 | 7.7% |
| universe | 22596 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 197758 | |
| 195704 | ||
| t | 174135 | |
| a | 151539 | |
| n | 121475 | |
| r | 98879 | |
| N | 97852 | |
| o | 97852 | |
| V | 76283 | 5.8% |
| i | 45192 | 3.4% |
| Other values (3) | 67788 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 954618 | |
| Space Separator | 195704 | 14.8% |
| Uppercase Letter | 174135 | 13.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 197758 | |
| t | 174135 | |
| a | 151539 | |
| n | 121475 | |
| r | 98879 | |
| o | 97852 | |
| i | 45192 | 4.7% |
| u | 22596 | 2.4% |
| v | 22596 | 2.4% |
| s | 22596 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97852 | |
| V | 76283 |
Space Separator
| Value | Count | Frequency (%) |
| 195704 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1128753 | |
| Common | 195704 | 14.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 197758 | |
| t | 174135 | |
| a | 151539 | |
| n | 121475 | |
| r | 98879 | |
| N | 97852 | |
| o | 97852 | |
| V | 76283 | 6.8% |
| i | 45192 | 4.0% |
| u | 22596 | 2.0% |
| Other values (2) | 45192 | 4.0% |
Common
| Value | Count | Frequency (%) |
| 195704 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1324457 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 197758 | |
| 195704 | ||
| t | 174135 | |
| a | 151539 | |
| n | 121475 | |
| r | 98879 | |
| N | 97852 | |
| o | 97852 | |
| V | 76283 | 5.8% |
| i | 45192 | 3.4% |
| Other values (3) | 67788 | 5.1% |
weeks_worked_in_year
Real number (ℝ)
High correlation  Zeros 
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.391226 |
| Minimum | 0 |
|---|---|
| Maximum | 52 |
| Zeros | 47009 |
| Zeros (%) | 47.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 10 |
| Q3 | 52 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 24.398786 |
|---|---|
| Coefficient of variation (CV) | 1.0430743 |
| Kurtosis | -1.8681688 |
| Mean | 23.391226 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.19363319 |
| Sum | 2312901 |
| Variance | 595.30075 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 47009 | |
| 52 | 35051 | |
| 40 | 1362 | 1.4% |
| 26 | 1125 | 1.1% |
| 50 | 1109 | 1.1% |
| 48 | 1010 | 1.0% |
| 12 | 952 | 1.0% |
| 20 | 725 | 0.7% |
| 30 | 725 | 0.7% |
| 8 | 564 | 0.6% |
| Other values (43) | 9247 | 9.4% |
| Value | Count | Frequency (%) |
| 0 | 47009 | |
| 1 | 226 | 0.2% |
| 2 | 209 | 0.2% |
| 3 | 207 | 0.2% |
| 4 | 361 | 0.4% |
| 5 | 127 | 0.1% |
| 6 | 315 | 0.3% |
| 7 | 71 | 0.1% |
| 8 | 564 | 0.6% |
| 9 | 126 | 0.1% |
| Value | Count | Frequency (%) |
| 52 | 35051 | |
| 51 | 413 | 0.4% |
| 50 | 1109 | 1.1% |
| 49 | 308 | 0.3% |
| 48 | 1010 | 1.0% |
| 47 | 124 | 0.1% |
| 46 | 302 | 0.3% |
| 45 | 349 | 0.4% |
| 44 | 459 | 0.5% |
| 43 | 188 | 0.2% |
year
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| 1995 | |
|---|---|
| 1994 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1995 |
|---|---|
| 2nd row | 1995 |
| 3rd row | 1995 |
| 4th row | 1994 |
| 5th row | 1995 |
Common Values
| Value | Count | Frequency (%) |
| 1995 | 49470 | |
| 1994 | 49409 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1995 | 49470 | |
| 1994 | 49409 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 197758 | |
| 1 | 98879 | |
| 5 | 49470 | 12.5% |
| 4 | 49409 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 395516 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 197758 | |
| 1 | 98879 | |
| 5 | 49470 | 12.5% |
| 4 | 49409 | 12.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 395516 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 197758 | |
| 1 | 98879 | |
| 5 | 49470 | 12.5% |
| 4 | 49409 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 395516 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 197758 | |
| 1 | 98879 | |
| 5 | 49470 | 12.5% |
| 4 | 49409 | 12.5% |
target
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 1 | |
|---|---|
| 0 | 6186 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 92693 | |
| 0 | 6186 | 6.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 92693 | |
| 0 | 6186 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 92693 | |
| 0 | 6186 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 98879 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 92693 | |
| 0 | 6186 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 98879 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 92693 | |
| 0 | 6186 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 98879 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 92693 | |
| 0 | 6186 | 6.3% |
Interactions
Correlations
| age | capital_gains | capital_losses | citizenship | class_of_worker | country_of_birth_father | country_of_birth_mother | country_of_birth_self | detailed_household_and_family_stat | detailed_household_summary_in_household | detailed_industry_recode | detailed_occupation_recode | dividends_from_stocks | education | enroll_in_edu_inst_last_wk | family_members_under_18 | fill_inc_questionnaire_for_veteran's_admin | full_or_part_time_employment_stat | hispanic_origin | instance_weight | live_in_this_house_1_year_ago | major_industry_code | major_occupation_code | marital_stat | member_of_a_labor_union | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | own_business_or_self_employed | race | reason_for_unemployment | region_of_previous_residence | sex | target | tax_filer_stat | veterans_benefits | wage_per_hour | weeks_worked_in_year | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.126 | 0.065 | 0.125 | 0.366 | 0.092 | 0.089 | 0.065 | 0.508 | 0.398 | 0.245 | 0.251 | 0.246 | 0.444 | 0.430 | 0.481 | 0.086 | 0.317 | 0.051 | 0.009 | 0.115 | 0.245 | 0.245 | 0.428 | 0.171 | 0.073 | 0.067 | 0.085 | 0.108 | 0.225 | 0.190 | 0.054 | 0.076 | 0.069 | 0.063 | 0.246 | 0.588 | 0.643 | 0.037 | 0.267 | 0.009 |
| capital_gains | 0.126 | 1.000 | -0.028 | 0.015 | 0.048 | 0.018 | 0.020 | 0.016 | 0.040 | 0.050 | 0.051 | 0.093 | 0.112 | 0.072 | 0.022 | 0.029 | 0.000 | 0.029 | 0.008 | 0.010 | 0.000 | 0.050 | 0.062 | 0.029 | 0.015 | 0.000 | 0.000 | 0.000 | 0.004 | 0.114 | 0.023 | 0.011 | 0.000 | 0.002 | 0.056 | 0.310 | 0.058 | 0.037 | 0.008 | 0.126 | 0.000 |
| capital_losses | 0.065 | -0.028 | 1.000 | 0.015 | 0.049 | 0.000 | 0.000 | 0.000 | 0.045 | 0.054 | 0.040 | 0.048 | 0.065 | 0.054 | 0.029 | 0.038 | 0.006 | 0.032 | 0.007 | 0.007 | 0.007 | 0.038 | 0.040 | 0.047 | 0.026 | 0.004 | 0.003 | 0.011 | 0.008 | 0.094 | 0.024 | 0.010 | 0.000 | 0.007 | 0.076 | 0.175 | 0.095 | 0.053 | 0.011 | 0.104 | 0.000 |
| citizenship | 0.125 | 0.015 | 0.015 | 1.000 | 0.057 | 0.538 | 0.542 | 0.719 | 0.110 | 0.113 | 0.099 | 0.121 | 0.000 | 0.139 | 0.019 | 0.090 | 0.017 | 0.043 | 0.395 | 0.055 | 0.032 | 0.086 | 0.099 | 0.104 | 0.012 | 0.078 | 0.015 | 0.075 | 0.033 | 0.042 | 0.021 | 0.244 | 0.029 | 0.075 | 0.004 | 0.044 | 0.056 | 0.088 | 0.017 | 0.033 | 0.006 |
| class_of_worker | 0.366 | 0.048 | 0.049 | 0.057 | 1.000 | 0.054 | 0.053 | 0.055 | 0.260 | 0.277 | 0.637 | 0.597 | 0.013 | 0.319 | 0.082 | 0.276 | 0.030 | 0.363 | 0.049 | 0.023 | 0.037 | 0.650 | 0.548 | 0.212 | 0.277 | 0.027 | 0.022 | 0.050 | 0.035 | 0.510 | 0.210 | 0.047 | 0.438 | 0.027 | 0.123 | 0.234 | 0.487 | 0.389 | 0.083 | 0.448 | 0.006 |
| country_of_birth_father | 0.092 | 0.018 | 0.000 | 0.538 | 0.054 | 1.000 | 0.781 | 0.668 | 0.095 | 0.067 | 0.029 | 0.036 | 0.000 | 0.114 | 0.037 | 0.071 | 0.017 | 0.058 | 0.540 | 0.057 | 0.036 | 0.034 | 0.049 | 0.089 | 0.041 | 0.046 | 0.029 | 0.033 | 0.049 | 0.043 | 0.047 | 0.442 | 0.021 | 0.054 | 0.024 | 0.072 | 0.077 | 0.079 | 0.000 | 0.030 | 0.032 |
| country_of_birth_mother | 0.089 | 0.020 | 0.000 | 0.542 | 0.053 | 0.781 | 1.000 | 0.683 | 0.090 | 0.064 | 0.029 | 0.036 | 0.000 | 0.111 | 0.038 | 0.067 | 0.014 | 0.055 | 0.546 | 0.056 | 0.035 | 0.034 | 0.048 | 0.086 | 0.040 | 0.048 | 0.027 | 0.034 | 0.052 | 0.041 | 0.045 | 0.446 | 0.023 | 0.056 | 0.022 | 0.070 | 0.073 | 0.074 | 0.000 | 0.028 | 0.029 |
| country_of_birth_self | 0.065 | 0.016 | 0.000 | 0.719 | 0.055 | 0.668 | 0.683 | 1.000 | 0.091 | 0.066 | 0.036 | 0.044 | 0.000 | 0.125 | 0.034 | 0.064 | 0.000 | 0.043 | 0.485 | 0.046 | 0.030 | 0.040 | 0.057 | 0.074 | 0.028 | 0.049 | 0.023 | 0.037 | 0.043 | 0.037 | 0.025 | 0.381 | 0.031 | 0.056 | 0.027 | 0.058 | 0.061 | 0.088 | 0.000 | 0.022 | 0.020 |
| detailed_household_and_family_stat | 0.508 | 0.040 | 0.045 | 0.110 | 0.260 | 0.095 | 0.090 | 0.091 | 1.000 | 0.982 | 0.265 | 0.275 | 0.027 | 0.428 | 0.205 | 0.527 | 0.049 | 0.181 | 0.080 | 0.037 | 0.068 | 0.266 | 0.268 | 0.489 | 0.107 | 0.057 | 0.038 | 0.084 | 0.065 | 0.258 | 0.095 | 0.075 | 0.040 | 0.055 | 0.054 | 0.196 | 0.536 | 0.505 | 0.052 | 0.274 | 0.000 |
| detailed_household_summary_in_household | 0.398 | 0.050 | 0.054 | 0.113 | 0.277 | 0.067 | 0.064 | 0.066 | 0.982 | 1.000 | 0.220 | 0.235 | 0.018 | 0.398 | 0.339 | 0.529 | 0.070 | 0.217 | 0.054 | 0.035 | 0.068 | 0.221 | 0.224 | 0.419 | 0.129 | 0.046 | 0.037 | 0.062 | 0.063 | 0.229 | 0.138 | 0.072 | 0.062 | 0.043 | 0.373 | 0.224 | 0.663 | 0.606 | 0.036 | 0.223 | 0.000 |
| detailed_industry_recode | 0.245 | 0.051 | 0.040 | 0.099 | 0.637 | 0.029 | 0.029 | 0.036 | 0.265 | 0.220 | 1.000 | 0.426 | 0.004 | 0.320 | 0.127 | 0.276 | 0.039 | 0.361 | 0.051 | 0.026 | 0.034 | 0.916 | 0.599 | 0.194 | 0.260 | 0.029 | 0.017 | 0.033 | 0.036 | 0.405 | 0.212 | 0.056 | 0.148 | 0.029 | 0.305 | 0.283 | 0.489 | 0.388 | 0.069 | 0.306 | 0.006 |
| detailed_occupation_recode | 0.251 | 0.093 | 0.048 | 0.121 | 0.597 | 0.036 | 0.036 | 0.044 | 0.275 | 0.235 | 0.426 | 1.000 | 0.014 | 0.403 | 0.159 | 0.277 | 0.043 | 0.361 | 0.064 | 0.022 | 0.039 | 0.566 | 1.000 | 0.203 | 0.261 | 0.031 | 0.025 | 0.035 | 0.039 | 0.395 | 0.220 | 0.069 | 0.157 | 0.028 | 0.392 | 0.437 | 0.498 | 0.388 | 0.079 | 0.310 | 0.008 |
| dividends_from_stocks | 0.246 | 0.112 | 0.065 | 0.000 | 0.013 | 0.000 | 0.000 | 0.000 | 0.027 | 0.018 | 0.004 | 0.014 | 1.000 | 0.039 | 0.003 | 0.018 | 0.007 | 0.018 | 0.000 | 0.010 | 0.007 | 0.011 | 0.010 | 0.024 | 0.000 | 0.000 | 0.001 | 0.000 | 0.003 | 0.151 | 0.010 | 0.007 | 0.000 | 0.000 | 0.009 | 0.145 | 0.036 | 0.025 | -0.001 | 0.155 | 0.004 |
| education | 0.444 | 0.072 | 0.054 | 0.139 | 0.319 | 0.114 | 0.111 | 0.125 | 0.428 | 0.398 | 0.320 | 0.403 | 0.039 | 1.000 | 0.326 | 0.454 | 0.045 | 0.280 | 0.101 | 0.025 | 0.022 | 0.319 | 0.372 | 0.299 | 0.148 | 0.018 | 0.024 | 0.071 | 0.016 | 0.285 | 0.159 | 0.066 | 0.064 | 0.012 | 0.064 | 0.377 | 0.545 | 0.707 | 0.051 | 0.287 | 0.010 |
| enroll_in_edu_inst_last_wk | 0.430 | 0.022 | 0.029 | 0.019 | 0.082 | 0.037 | 0.038 | 0.034 | 0.205 | 0.339 | 0.127 | 0.159 | 0.003 | 0.326 | 1.000 | 0.153 | 0.014 | 0.074 | 0.021 | 0.011 | 0.018 | 0.133 | 0.116 | 0.199 | 0.027 | 0.019 | 0.010 | 0.025 | 0.020 | 0.075 | 0.066 | 0.023 | 0.078 | 0.022 | 0.015 | 0.066 | 0.177 | 0.103 | 0.024 | 0.185 | 0.008 |
| family_members_under_18 | 0.481 | 0.029 | 0.038 | 0.090 | 0.276 | 0.071 | 0.067 | 0.064 | 0.527 | 0.529 | 0.276 | 0.277 | 0.018 | 0.454 | 0.153 | 1.000 | 0.043 | 0.222 | 0.066 | 0.021 | 0.029 | 0.276 | 0.277 | 0.350 | 0.128 | 0.023 | 0.016 | 0.073 | 0.025 | 0.284 | 0.126 | 0.104 | 0.045 | 0.021 | 0.034 | 0.156 | 0.532 | 0.629 | 0.043 | 0.284 | 0.000 |
| fill_inc_questionnaire_for_veteran's_admin | 0.086 | 0.000 | 0.006 | 0.017 | 0.030 | 0.017 | 0.014 | 0.000 | 0.049 | 0.070 | 0.039 | 0.043 | 0.007 | 0.045 | 0.014 | 0.043 | 1.000 | 0.035 | 0.015 | 0.013 | 0.000 | 0.040 | 0.031 | 0.068 | 0.006 | 0.005 | 0.000 | 0.011 | 0.000 | 0.022 | 0.005 | 0.011 | 0.000 | 0.002 | 0.066 | 0.029 | 0.024 | 0.707 | 0.008 | 0.018 | 0.000 |
| full_or_part_time_employment_stat | 0.317 | 0.029 | 0.032 | 0.043 | 0.363 | 0.058 | 0.055 | 0.043 | 0.181 | 0.217 | 0.361 | 0.361 | 0.018 | 0.280 | 0.074 | 0.222 | 0.035 | 1.000 | 0.032 | 0.019 | 0.551 | 0.361 | 0.360 | 0.191 | 0.153 | 0.448 | 0.551 | 0.456 | 0.162 | 0.310 | 0.135 | 0.022 | 0.074 | 0.132 | 0.102 | 0.153 | 0.279 | 0.306 | 0.055 | 0.326 | 0.790 |
| hispanic_origin | 0.051 | 0.008 | 0.007 | 0.395 | 0.049 | 0.540 | 0.546 | 0.485 | 0.080 | 0.054 | 0.051 | 0.064 | 0.000 | 0.101 | 0.021 | 0.066 | 0.015 | 0.032 | 1.000 | 0.061 | 0.039 | 0.045 | 0.054 | 0.058 | 0.045 | 0.038 | 0.031 | 0.028 | 0.057 | 0.037 | 0.035 | 0.157 | 0.019 | 0.047 | 0.011 | 0.067 | 0.077 | 0.070 | 0.010 | 0.026 | 0.040 |
| instance_weight | 0.009 | 0.010 | 0.007 | 0.055 | 0.023 | 0.057 | 0.056 | 0.046 | 0.037 | 0.035 | 0.026 | 0.022 | 0.010 | 0.025 | 0.011 | 0.021 | 0.013 | 0.019 | 0.061 | 1.000 | 0.031 | 0.020 | 0.021 | 0.021 | 0.015 | 0.032 | 0.026 | 0.017 | 0.035 | 0.042 | 0.024 | 0.087 | 0.011 | 0.033 | 0.033 | 0.011 | 0.046 | 0.027 | 0.023 | 0.028 | 0.026 |
| live_in_this_house_1_year_ago | 0.115 | 0.000 | 0.007 | 0.032 | 0.037 | 0.036 | 0.035 | 0.030 | 0.068 | 0.068 | 0.034 | 0.039 | 0.007 | 0.022 | 0.018 | 0.029 | 0.000 | 0.551 | 0.039 | 0.031 | 1.000 | 0.032 | 0.032 | 0.062 | 0.007 | 0.992 | 0.815 | 1.000 | 0.707 | 0.036 | 0.048 | 0.041 | 0.028 | 0.707 | 0.005 | 0.025 | 0.051 | 0.015 | 0.000 | 0.036 | 0.986 |
| major_industry_code | 0.245 | 0.050 | 0.038 | 0.086 | 0.650 | 0.034 | 0.034 | 0.040 | 0.266 | 0.221 | 0.916 | 0.566 | 0.011 | 0.319 | 0.133 | 0.276 | 0.040 | 0.361 | 0.045 | 0.020 | 0.032 | 1.000 | 0.591 | 0.194 | 0.260 | 0.028 | 0.019 | 0.032 | 0.032 | 0.402 | 0.212 | 0.054 | 0.149 | 0.025 | 0.297 | 0.280 | 0.489 | 0.388 | 0.068 | 0.306 | 0.009 |
| major_occupation_code | 0.245 | 0.062 | 0.040 | 0.099 | 0.548 | 0.049 | 0.048 | 0.057 | 0.268 | 0.224 | 0.599 | 1.000 | 0.010 | 0.372 | 0.116 | 0.277 | 0.031 | 0.360 | 0.054 | 0.021 | 0.032 | 0.591 | 1.000 | 0.197 | 0.245 | 0.027 | 0.019 | 0.033 | 0.033 | 0.378 | 0.213 | 0.055 | 0.144 | 0.025 | 0.332 | 0.366 | 0.493 | 0.387 | 0.069 | 0.304 | 0.009 |
| marital_stat | 0.428 | 0.029 | 0.047 | 0.104 | 0.212 | 0.089 | 0.086 | 0.074 | 0.489 | 0.419 | 0.194 | 0.203 | 0.024 | 0.299 | 0.199 | 0.350 | 0.068 | 0.191 | 0.058 | 0.021 | 0.062 | 0.194 | 0.197 | 1.000 | 0.093 | 0.042 | 0.030 | 0.060 | 0.056 | 0.188 | 0.077 | 0.079 | 0.037 | 0.038 | 0.166 | 0.196 | 0.718 | 0.448 | 0.039 | 0.199 | 0.007 |
| member_of_a_labor_union | 0.171 | 0.015 | 0.026 | 0.012 | 0.277 | 0.041 | 0.040 | 0.028 | 0.107 | 0.129 | 0.260 | 0.261 | 0.000 | 0.148 | 0.027 | 0.128 | 0.006 | 0.153 | 0.045 | 0.015 | 0.007 | 0.260 | 0.245 | 0.093 | 1.000 | 0.008 | 0.006 | 0.021 | 0.005 | 0.226 | 0.073 | 0.025 | 0.041 | 0.007 | 0.030 | 0.072 | 0.165 | 0.126 | 0.350 | 0.221 | 0.008 |
| migration_code_change_in_msa | 0.073 | 0.000 | 0.004 | 0.078 | 0.027 | 0.046 | 0.048 | 0.049 | 0.057 | 0.046 | 0.029 | 0.031 | 0.000 | 0.018 | 0.019 | 0.023 | 0.005 | 0.448 | 0.038 | 0.032 | 0.992 | 0.028 | 0.027 | 0.042 | 0.008 | 1.000 | 0.852 | 0.794 | 0.705 | 0.026 | 0.050 | 0.048 | 0.023 | 0.630 | 0.007 | 0.027 | 0.053 | 0.016 | 0.000 | 0.027 | 0.982 |
| migration_code_change_in_reg | 0.067 | 0.000 | 0.003 | 0.015 | 0.022 | 0.029 | 0.027 | 0.023 | 0.038 | 0.037 | 0.017 | 0.025 | 0.001 | 0.024 | 0.010 | 0.016 | 0.000 | 0.551 | 0.031 | 0.026 | 0.815 | 0.019 | 0.019 | 0.030 | 0.006 | 0.852 | 1.000 | 1.000 | 0.440 | 0.030 | 0.042 | 0.038 | 0.025 | 0.458 | 0.006 | 0.016 | 0.027 | 0.016 | 0.000 | 0.035 | 0.986 |
| migration_code_move_within_reg | 0.085 | 0.000 | 0.011 | 0.075 | 0.050 | 0.033 | 0.034 | 0.037 | 0.084 | 0.062 | 0.033 | 0.035 | 0.000 | 0.071 | 0.025 | 0.073 | 0.011 | 0.456 | 0.028 | 0.017 | 1.000 | 0.032 | 0.033 | 0.060 | 0.021 | 0.794 | 1.000 | 1.000 | 0.738 | 0.042 | 0.058 | 0.041 | 0.024 | 0.708 | 0.007 | 0.037 | 0.094 | 0.109 | 0.000 | 0.036 | 1.000 |
| migration_prev_res_in_sunbelt | 0.108 | 0.004 | 0.008 | 0.033 | 0.035 | 0.049 | 0.052 | 0.043 | 0.065 | 0.063 | 0.036 | 0.039 | 0.003 | 0.016 | 0.020 | 0.025 | 0.000 | 0.162 | 0.057 | 0.035 | 0.707 | 0.032 | 0.033 | 0.056 | 0.005 | 0.705 | 0.440 | 0.738 | 1.000 | 0.029 | 0.046 | 0.025 | 0.027 | 0.867 | 0.001 | 0.024 | 0.048 | 0.002 | 0.000 | 0.036 | 0.290 |
| num_persons_worked_for_employer | 0.225 | 0.114 | 0.094 | 0.042 | 0.510 | 0.043 | 0.041 | 0.037 | 0.258 | 0.229 | 0.405 | 0.395 | 0.151 | 0.285 | 0.075 | 0.284 | 0.022 | 0.310 | 0.037 | 0.042 | 0.036 | 0.402 | 0.378 | 0.188 | 0.226 | 0.026 | 0.030 | 0.042 | 0.029 | 1.000 | 0.223 | 0.047 | 0.060 | 0.021 | 0.109 | 0.237 | 0.522 | 0.406 | 0.224 | 0.878 | 0.034 |
| own_business_or_self_employed | 0.190 | 0.023 | 0.024 | 0.021 | 0.210 | 0.047 | 0.045 | 0.025 | 0.095 | 0.138 | 0.212 | 0.220 | 0.010 | 0.159 | 0.066 | 0.126 | 0.005 | 0.135 | 0.035 | 0.024 | 0.048 | 0.212 | 0.213 | 0.077 | 0.073 | 0.050 | 0.042 | 0.058 | 0.046 | 0.223 | 1.000 | 0.032 | 0.045 | 0.049 | 0.047 | 0.078 | 0.185 | 0.126 | 0.025 | 0.238 | 0.013 |
| race | 0.054 | 0.011 | 0.010 | 0.244 | 0.047 | 0.442 | 0.446 | 0.381 | 0.075 | 0.072 | 0.056 | 0.069 | 0.007 | 0.066 | 0.023 | 0.104 | 0.011 | 0.022 | 0.157 | 0.087 | 0.041 | 0.054 | 0.055 | 0.079 | 0.025 | 0.048 | 0.038 | 0.041 | 0.025 | 0.047 | 0.032 | 1.000 | 0.023 | 0.043 | 0.019 | 0.059 | 0.105 | 0.055 | 0.005 | 0.040 | 0.045 |
| reason_for_unemployment | 0.076 | 0.000 | 0.000 | 0.029 | 0.438 | 0.021 | 0.023 | 0.031 | 0.040 | 0.062 | 0.148 | 0.157 | 0.000 | 0.064 | 0.078 | 0.045 | 0.000 | 0.074 | 0.019 | 0.011 | 0.028 | 0.149 | 0.144 | 0.037 | 0.041 | 0.023 | 0.025 | 0.024 | 0.027 | 0.060 | 0.045 | 0.023 | 1.000 | 0.021 | 0.046 | 0.029 | 0.075 | 0.070 | 0.009 | 0.116 | 0.018 |
| region_of_previous_residence | 0.069 | 0.002 | 0.007 | 0.075 | 0.027 | 0.054 | 0.056 | 0.056 | 0.055 | 0.043 | 0.029 | 0.028 | 0.000 | 0.012 | 0.022 | 0.021 | 0.002 | 0.132 | 0.047 | 0.033 | 0.707 | 0.025 | 0.025 | 0.038 | 0.007 | 0.630 | 0.458 | 0.708 | 0.867 | 0.021 | 0.049 | 0.043 | 0.021 | 1.000 | 0.008 | 0.026 | 0.049 | 0.000 | 0.000 | 0.026 | 0.290 |
| sex | 0.063 | 0.056 | 0.076 | 0.004 | 0.123 | 0.024 | 0.022 | 0.027 | 0.054 | 0.373 | 0.305 | 0.392 | 0.009 | 0.064 | 0.015 | 0.034 | 0.066 | 0.102 | 0.011 | 0.033 | 0.005 | 0.297 | 0.332 | 0.166 | 0.030 | 0.007 | 0.006 | 0.007 | 0.001 | 0.109 | 0.047 | 0.019 | 0.046 | 0.008 | 1.000 | 0.159 | 0.037 | 0.072 | 0.036 | 0.116 | 0.000 |
| target | 0.246 | 0.310 | 0.175 | 0.044 | 0.234 | 0.072 | 0.070 | 0.058 | 0.196 | 0.224 | 0.283 | 0.437 | 0.145 | 0.377 | 0.066 | 0.156 | 0.029 | 0.153 | 0.067 | 0.011 | 0.025 | 0.280 | 0.366 | 0.196 | 0.072 | 0.027 | 0.016 | 0.037 | 0.024 | 0.237 | 0.078 | 0.059 | 0.029 | 0.026 | 0.159 | 1.000 | 0.220 | 0.142 | 0.067 | 0.269 | 0.019 |
| tax_filer_stat | 0.588 | 0.058 | 0.095 | 0.056 | 0.487 | 0.077 | 0.073 | 0.061 | 0.536 | 0.663 | 0.489 | 0.498 | 0.036 | 0.545 | 0.177 | 0.532 | 0.024 | 0.279 | 0.077 | 0.046 | 0.051 | 0.489 | 0.493 | 0.718 | 0.165 | 0.053 | 0.027 | 0.094 | 0.048 | 0.522 | 0.185 | 0.105 | 0.075 | 0.049 | 0.037 | 0.220 | 1.000 | 0.504 | 0.078 | 0.532 | 0.003 |
| veterans_benefits | 0.643 | 0.037 | 0.053 | 0.088 | 0.389 | 0.079 | 0.074 | 0.088 | 0.505 | 0.606 | 0.388 | 0.388 | 0.025 | 0.707 | 0.103 | 0.629 | 0.707 | 0.306 | 0.070 | 0.027 | 0.015 | 0.388 | 0.387 | 0.448 | 0.126 | 0.016 | 0.016 | 0.109 | 0.002 | 0.406 | 0.126 | 0.055 | 0.070 | 0.000 | 0.072 | 0.142 | 0.504 | 1.000 | 0.056 | 0.397 | 0.000 |
| wage_per_hour | 0.037 | 0.008 | 0.011 | 0.017 | 0.083 | 0.000 | 0.000 | 0.000 | 0.052 | 0.036 | 0.069 | 0.079 | -0.001 | 0.051 | 0.024 | 0.043 | 0.008 | 0.055 | 0.010 | 0.023 | 0.000 | 0.068 | 0.069 | 0.039 | 0.350 | 0.000 | 0.000 | 0.000 | 0.000 | 0.224 | 0.025 | 0.005 | 0.009 | 0.000 | 0.036 | 0.067 | 0.078 | 0.056 | 1.000 | 0.216 | 0.000 |
| weeks_worked_in_year | 0.267 | 0.126 | 0.104 | 0.033 | 0.448 | 0.030 | 0.028 | 0.022 | 0.274 | 0.223 | 0.306 | 0.310 | 0.155 | 0.287 | 0.185 | 0.284 | 0.018 | 0.326 | 0.026 | 0.028 | 0.036 | 0.306 | 0.304 | 0.199 | 0.221 | 0.027 | 0.035 | 0.036 | 0.036 | 0.878 | 0.238 | 0.040 | 0.116 | 0.026 | 0.116 | 0.269 | 0.532 | 0.397 | 0.216 | 1.000 | 0.007 |
| year | 0.009 | 0.000 | 0.000 | 0.006 | 0.006 | 0.032 | 0.029 | 0.020 | 0.000 | 0.000 | 0.006 | 0.008 | 0.004 | 0.010 | 0.008 | 0.000 | 0.000 | 0.790 | 0.040 | 0.026 | 0.986 | 0.009 | 0.009 | 0.007 | 0.008 | 0.982 | 0.986 | 1.000 | 0.290 | 0.034 | 0.013 | 0.045 | 0.018 | 0.290 | 0.000 | 0.019 | 0.003 | 0.000 | 0.000 | 0.007 | 1.000 |
Missing values
Sample
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 38 | Private sector | Transportation | Construction trades | Below High School | 0 | Not in universe | Married | Manufacturing-durable goods | Machine operators assmblrs & inspctrs | White | Mexican (Mexicano) | Female | Not in universe | Not in universe | FTE | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Spouse of householder | 1032.38 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 4 | Not in universe | Mexico | Mexico | Mexico | Foreign | Not in universe | Not in universe | Not a Veteran | 12 | 1995 | 1 |
| 1 | 44 | Self-employed | Wholesale and retail trade | Other professional specialty occupations | Some College | 0 | Not in universe | Married | Business and repair services | Professional specialty | White | All other | Female | Not in universe | Not in universe | PTE | 0 | 0 | 2500 | Joint Filer | Not in universe | Not in universe | Primary Householder | Spouse of householder | 1462.33 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 1 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 26 | 1995 | 1 |
| 2 | 2 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | Mexican-American | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1601.75 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1995 | 1 |
| 3 | 35 | Private sector | Business and repair services | Management related occupations | High School Graduate | 0 | Not in universe | Divorced | Transportation | Executive admin and managerial | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Primary Householder | Householder | 1866.88 | No movement | Same area | Nonmover | Yes | Not in universe | 5 | Not in universe | United-States | United-States | United-States | Native | No | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| 4 | 49 | Private sector | Manufacturing-durable goods | Automobile mechanics and repairers | High School Graduate | 0 | Not in universe | Divorced | Construction | Precision production craft & repair | White | All other | Male | Not in universe | Not in universe | FTE | 0 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Other | Nonrelative of householder | 1394.54 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 4 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 50 | 1995 | 1 |
| 5 | 13 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 2556.34 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | Germany | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 |
| 6 | 1 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | Mexican-American | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1723.61 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | Mexico | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 |
| 7 | 61 | Not in universe | Not in universe or children | Not in universe | High School Graduate | 0 | Not in universe | Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Spouse of householder | 1083.03 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 |
| 8 | 38 | Private sector | Trade | Other professional specialty occupations | Advanced Degree | 0 | Not in universe | Married | Other professional services | Professional specialty | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Householder | 1767.95 | No movement | Same area | Nonmover | Yes | Not in universe | 1 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| 9 | 7 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1595.19 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 |
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99751 | 22 | Private sector | Manufacturing | Food service occupations | Some College | 0 | College or university | Never Married | Education | Adm support including clerical | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Primary Householder | Householder | 1164.53 | No movement | Same area | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| 99752 | 46 | Private sector | Transport, communications, utilities | Management related occupations | Some College | 0 | Not in universe | Married | Social services | Executive admin and managerial | Black | All other | Female | Not in universe | Not in universe | FTE | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Spouse of householder | 1197.34 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 1 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1995 | 1 |
| 99753 | 2 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1858.67 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1995 | 1 |
| 99754 | 17 | Private sector | Public administration | Personal service occupations | Below High School | 0 | High school | Never Married | Retail trade | Other service | White | All other | Female | Not in universe | Not in universe | FTE | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1414.11 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 6 | Mother only present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 1 | 1995 | 1 |
| 99755 | 20 | Not in universe | Not in universe or children | Not in universe | Some College | 0 | College or university | Never Married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Other | Nonrelative of householder | 1544.21 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 |
| 99756 | 4 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | Mexican-American | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1335.91 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1995 | 1 |
| 99758 | 61 | Private sector | Utilities and sanitary services | Construction trades | Below High School | 0 | Not in universe | Separated | Manufacturing-durable goods | Machine operators assmblrs & inspctrs | Black | All other | Male | No | Not in universe | FTE | 0 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Primary Householder | Householder | 2511.11 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 4 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1995 | 1 |
| 99759 | 24 | Self-employed | Agriculture | Other transportation and material moving | Below High School | 0 | Not in universe | Married | Agriculture | Farming forestry and fishing | White | Mexican (Mexicano) | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Other | Nonrelative of householder | 2083.76 | No movement | Same area | Nonmover | Yes | Not in universe | 2 | Not in universe | Mexico | Mexico | Mexico | Naturalized | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| 99760 | 30 | Private sector | Trade | Other executive, admin and managerial | College Graduate | 0 | Not in universe | Married | Other professional services | Executive admin and managerial | White | All other | Female | Not in universe | Not in universe | FTE | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Spouse of householder | 1680.06 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 5 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1995 | 1 |
| 99761 | 67 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Householder | 1582.48 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1994 | 1 |
Duplicate rows
Most frequently occurring
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | target | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 3556.10 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Mother only present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 | 2 |
| 1 | 5 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 538.04 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Mother only present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1995 | 1 | 2 |
| 2 | 5 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 1958.46 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Neither parent present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 | 2 |
| 3 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 1334.77 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Mother only present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1994 | 1 | 2 |
| 4 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1978.23 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1994 | 1 | 2 |
| 5 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 2100.03 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1994 | 1 | 2 |
| 6 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 993.45 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 | 2 |
| 7 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1332.77 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 | 2 |
| 8 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 2575.48 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 | 2 |
| 9 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1022.09 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Mother only present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1994 | 1 | 2 |